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(Date de priorite (la plus ancienne) 
( jourfmoisfann ee ) 
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Le present rapport de recherche Internationale, etabli par l'administration chargee de la recherche Internationale, est transmis au 
deposant conformement a 1' article 18. Une copie en est transmise au Bureau international. 



Ce rapport de recherche international comprend 



feuilles. 



f)(l U est aussi accompagne d'une copie de chaque document relatif a l'etat de la technique qui y est cite. 



1 . | | D a etc estime que certaines revendications ne pouvaient pas faire I'objet d'une recherche (voir le cadre I). 

2. | | D y a absence d'unite de (Invention (voir le cadre II). 

3. | X 1 La demande internationale contient la divulgation d'un list age de sequence de nucleotides ou d'acides amines et la 

recherche internationale a ete effectuee sur la base du listage de sequence 

| | depose avec la demande internationale 

| X 1 fourni par le deposant separement de la demande internationale 

| [ sans etre accompagnee d'une declaration selon laquelle it n'inclut pas d'elements 
allant au-dela de la divulgation faite dans la demande internationale telle 
qu'elle a ete deposee. 

| | transcrit par l'administration 



4. En ce qui concerne le titre, | X | le texte est appro uve tel qu'il a ete remise par le deposant. 

| | Le texte a ete etabli par l'administration et a la teneur suivante: 



5. En ce qui concerne l'abrege, 



□ 



le texte est appro uve tel qu'il a ete remis par le deposant 

le texte (reproduit dans le cadre III) a ete etabli par ('administration conformement a la 
regie 38.2b). Le deposant peut presenter des observations a l'administration dans un delai 
d'un mois a compter de la date d'expedition du present rapport de recherche internationale. 



6. La figure des dessins a publier avec l'abrege est la suivante: 
Figure n" | | suggeree par le deposant. 



| | parce que le deposant n'a pas suggere de figure. 
| | parce que cette figure caracterise mieux l'invention. 



Aucune des figures 
n'est a publier. 
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j RAPPORT DE RECHERCHE INTERNATIONALE 


Demande Internationale No 




PCT/FR y//0016/ 



A CLASSEMENT DE L'OBJET DE LA DEMANDE - ^ „ 

CIB 6 C12N15/29 C12N15/82 C07K14/425 A01H5/00 



Scion la classification Int ernationale des brevets (CIB) ou a la fois selon la classification nationale et la CIB 

B. DOMAINES SUR LESQUELS LA RECHERCHE A PORTE 

Documentation mini male consul tee (systeme de classification suivi des symboles de classement) 

CIB 6 C12N C07K A01H 



Documentation consultee autre que la documentation mini male dans la mesure ou ces documents relevent des domaines sur lesquels a porte la recherche 



Base de donnees electronique consultee au cours de la recherche Internationale (nom de la base de donnees, et si cela est realisable, termes de recherche 
utilises) 



C. DOCUMENTS CONSIDERES COM ME PERTINENTS 



Categoric * 


Identification des documents cites, avec, le cas echeant, 1'indication des passages pertinents 


no. des revendications visees 


X 


WO 93 15221 A (HITACHI CHEMICAL CO LTD 
;HITACHI CHEMICAL RESEARCH CENT (US)) 5 
Aout 1993 

voir seq. I .D no. 186 
voir page 46 


1 


X 


WO 93 12230 A (STANFORD RES INST INT) 24 
Juin 1993 

voir page 12, ligne 22; figure 14A 
voir page 65 


2 


X 


EMBL SEQUENCE DATABASE , REL.18, ACCESSION 
NO. M19784, 20-FEB-1989, XP002035349 
FORNEY, J.D., ET AL. : 
voir sequence 

-/" 


2 



)M Voir la suite du cadre C pour la fin de la liste des documents |)( I Les documents de families de brevets sont indiques en annexe 



"T" document ulterieur publie apres la date de depot international ou la 
date de priori tie et n'appartenenant pas a l'etat de la 
technique pertinent, mais cite pour comprendre le principe 
ou la theorie constituant la base de I' invention 

*X" document parti cull e re ment pertinent; 1' invention revendiquee ne peut 
etre consideree comme nouvelle ou comme impliquant une activite 
inventive par rapport au document considere isolement 

*Y" document parti culierement pertinent; l'invention revendiquee 

ne peut etre consideree comme impliquant une activite inventive 
lorsque le document est associe a un ou plusieurs autres 
documents de m2me nature, cette combinaison etant evidente 
pour une personnc du metier 

*&" document qui fait parti e de la meme famille de brevets 



Date a laquelle la recherche intemationale a etc eflectivement achevee 

18 Juillet 1997 


Date d'expedition du present rapport de recherche intemationale 

3 0. 07. 97 


Nom et adresse postal e de 1' administration chargee de la recherche intemationale 
Office Europeen des Brevets, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: ( + 31-70) 340-3016 


Fonctionnaire autorise 

Maddox, A 



Form ul aire PCT/ISA/210 (deuxieme feuille) (juillet 1993} 



° Categories special es de documents cites: 

"A" document de finis sant l'etat general de la technique, non 
considere comme particulierement pertinent 

"E" document anterieur, mais publie a la date de depot international 

ou apres cette date 
*L" document pouvant jeter un doute sur une revendi cation de 

priorite ou cite pour determiner la date de publication d'une 

autre citation ou pour une raison speciale (telle qu'indiquee) 
'O* document se referant a une divulgation orale, a un usage, a 

une exposition ou tous autres moyens 

*P* document publie avant la date de depdt international, mais 
posterieurement a la date de pnonte revendiquee 
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v RAPPORT DE RECHERCHE INTERNATIONALE 



Demande Internationale No 

PCT/FR 97/00167 



C(suite) DOCUMENTS CONSIDERES COMME PERTINENTS 



Categorie * Identification des documents cites, avec, le cas echeant, l'indication des passages pertinents 



no. des re vendi cations visees 



CHEMICAL ABSTRACTS, vol. 104, no. 19, 
1986 

Columbus, Ohio, US; 

abstract no. 168751, 

SCHOTT, H., ET AL. : "Synthesis of 

fragments of the terminal inverted 

repeating units of macronuclear DNA from 

hypotrichous ciliates" 

XP002034741 

voir abrege 

& MAKR0M0L. CHEM. , 

vol. 187, no. 1, 1986, 

pages 81-104, 

EMBL SEQUENCE DATABASE ACCESSION 
N0.R40179, RELEASE 43, 8-05-1995., 
XP002034738 

HILLIER, L., ET AL. : n yf70e09.sl Homo 
sapiens cDNA clone 27461 3' similar to 
contains MER22 repetitive element" 
voir sequence 

GCG-GENESEQ DATABASE ACCESSION NO. R28869, 
23-MAR-1993, XP002034739 
voir sequence 

& WO 92 19248 A (UNIVERSITY OF WASHINGTON) 
12 Novembre 1992 

THE JOURNAL OF BIOLOGICAL CHEMISTRY, 
vol. 270, no. 18, 5 Mai 1995, 
pages 10847-10854, XP002035350 
QI, Z., ET AL.: "Reconstitution of 
neronal cdc2-like kinase from 
bacteria-expressed cdk5 and an active 
fragement of the brain-specific activator" 
voir page 10852; tableau II 

PLANT CELL 6 (12). 1994. 1911-1922., 
XP002017847 

GELI M I ET AL: "Two structural domains 

mediate two sequential events in 

gamma-Zein targeting: Protein endoplasmic 

reticulum retention and protein body 

formation." 

cite dans la demande 

voir le document en entier 

PLANT MOL BIOL 16 (1). 1991. 117-128., 
XP002017848 

OHTANI T ET AL: "NORMAL AND 

LYSINE-CONTAINING ZEINS ARE UNSTABLE IN 

TRANSGENIC TOBACCO SEEDS." 

cite dans la demande 

voir le document en entier 



-/- 



24 



24 



1-41 



1-41 



Formulaire PCT/ISA/210 (suite de la deuxieme feuille) (juillet 1992) 
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Demande Internationale No 

PCT/FR 97/00167 



C.(suite) DOCUMENTS CONSIDERES COMME PERTINENTS 



Categorie 



Identification des documents cites, avec, le cas echeant, 1 'indication des passages pertinents 



no. des re vendi cations visees 



EP 0 318 341 A (PLANT GENETIC SYSTEMS NV) 
31 Mai 1989 

voir le document en entier 

EP 0 2G8 418 A (LUBRIZOL GENETICS INC) 14 
Janvier 1987 

voir le document en entier 

WO 95 06128 A (DEKALB GENETICS CORP) ' 2 
Mars 1995 

voir page 211 - page 222 

PLANT MOLECULAR BIOLOGY, 

vol. 34, Mai 1997, 

pages 139-149, XPO0203474O 

TORRENT, M., ET AL. : "Lysine-rich 

modified gamma-zeins accumulate in protein 

bodies of transiently transformed maize 

endosperms" 

voir le document en entier 



1-41 



1-41 



41 



1-41 
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INTERNATIONAL SEARCH REPORT 

Information on patent family members 



International Application No 

PCT/FR 97/00167 



Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 


wn Q315??1 A 


05-08-93 


jp 


7506482 T 


20-07-95 




C.*T UU 7J 


wo 


9312234 A 


24-06-93 


tr UjIOjHI M 


31-05-89 


CA 


1337048 A 


19-09-95 






AU 


634987 B 


11-03-93 






AU 


4495189 A 


01-05-90 






CA 


2000661 A 


14-04-90 






WO 


9004032 A 


19-04-90 






JP 


3502644 T 


20-06-91 






US 


5589615 A 


31-12-96 






AU 


2811889 A 


23-05-89 






WO 


8903887 A 


05-05-89 






EP 


0319353 A 


07-06-89 






JP 


2501802 T 


21-06-90 






US 


5487991 A 


30-01-96 






AT 


140959 T 


15-08-96 






DE 


3855455 D 


05-09-96 






DE 


3855455 T 


09-01-97 






AT 


143694 T 


15-10-96 






DE 


3855591 D 


07-11-96 






DE 


3855591 T 


03-04-97 






EP 


0723019 A 


24-07-96 






US 


5623067 A 


22-04-97 


tr U£UO**J.O M 


J.*T Wl O/ 


AU 


592090 B 


04-01-90 






AU 


5853686 A 


18-12-86 






CA 


1304025 A 


23-06-92 






DE 


3682233 A 


05-12-91 






JP 


62059300 A 


14-03-87 






US 


4885357 A 


05-12-89 






US 


4886878 A 


12-12-89 


WO 9506128 A 


02-03-95 


AU 


7716994 A 


21-03-95 \ 






CA 


2170260 A 


02-03-95 






EP 


0721509 A 


17-07-96 






HU 


74392 A 


30-12-96 






ZA 


9406488 A 


30-11-95 
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INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US93/00977 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 1 



Citation f document, with indication, where appropriate, of the relevant passage* 



Relevant to claim No 



Y 



Y 



Y 



Gene Analysis Techniques, Volume 4, issued 1987, C. B. Harley, 
"Hybridization of 01igo(dT) to RNA on Nitrocellulose", pages 17- 
22, see entire document. 

Methods in Enzymology, Vol. 65, Pt. 1, issued 1980, A. R. Dunn 
et al, "Mapping Viral mRNAs by Sandwich Hybridization", pages 
468-478, especially page 46, second full paragraph. 

DNA, Volume 7, No. 2, issued 1988, A. Palva et al, "Laboratory 
Methods: Quantification of a-Amylase mRNA in Bacillus subtil is 
by Nucleic Acid Sandwich Hybridization", pages 135-142, see 
entire document. 

US, A, 4,957,865 (Samarut et al) 18 September 1990, col. 11, 
lines 34-41. 

Journal of Molecular Evolution, Volume 19, Numbers 3-4, R. 
Nussinov, "Efficient algorithms for searching for exact repetition 
of nucleotide sequences.", pages 283-285, see Abstract 05061573. 

Computer Applications in the Biosciences, Volume 8, No. 3, 
issued September 1987, W. W. Ralph et al, "A modified Chou 
and Fasman protein structure algorithm", pages 211-216 see 
Abstract 06625157. 

Biochemical and Biophysical Research Communications, Volume 
170, No. 1, issued 1990, C. Hough et al, "Differential down- 
regulation of fi r and 02-adrenergic receptor mRNA in C 6 glioma 
cells", pages 46-52, see entire document. 

Analytical Biochemistry, Volume 196, issued 1991, A. Ballagi- 
Pordany et al, "Quantitative determination of mRNA phenotypes 
by the polymerase chain reaction", pages 89-94, see especially the 
Abstract. 

Journal of Biological Chemistry, Volume 265, No. 27, issued 25 
September 1990, B. Feve et al, "Differential regulation of 0 r and 
0 2 -adrenergic receptor protein and mRNA levels by 
glucocorticoids during 3T3-F442A adipose differentiation", pages 
16343-16349, see entire document. 



Journal of Biological Chemistry, Volume 266, No. 35, issued 15 
December 1991, P. Muzzin et al, "An adipose tissue- specific /S- 
adrenergic receptor", pages 24053-24058, especially Figures 1 and 
2 • 



1-28, 33-40 



1-3. 6 



4, 5, 7-28, 33-40 



1. 2. v 



4, 5, 7-28, 33-40 



29-31 
32 

23 



23-25 



9, 33, 34 



1-3 



9, 33, 34 



9, 33, 34 
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C (Continuation). IX)CUMENTS CONSIDERED TO BE RELEVANT 


Category 0 


Citation of document, with indication, where appropriate, of the relevant paaaagea 


Relevant to claim No. 


Y 


Analytical Biochemistry, Volume 170, issued 1988, C. Albretsen 

©X ai, ijpurnai conuiuuns iur nyuriui^iijijii wiui uiigijiiuvjc^juuwa. 
A study with myc-oncogene DNA probes", pages 193-202, 
especially paragraph bridging pages 193 and 194. 


1-28 


Y 


Proceedings of the National Academy of Science USA, Volume 

O^f, 1SSU6Q XVuircn / 1 IVl. O. VVaiCI IlldJl CI dl, X^llaaC LldJlalUUIlS 

in sequence matches and nucleic acid structure", pages 1239-1243, 
see entire document. 


1-28 


Y 


Journal of Molecular Biology, Volume 197, issued 1987, M. S. 
waterman ei ai, /\ new ajgonuirn ror ocot ^uuscijuciii^c 
alignments with application to tRNA-rRNA comparisons", pages 
723-728, see entire document. 


1-23 


Y 


Biotechniques, Volume 12, No. 2, issued Feb 1992, R. J. Cano et 
ai, jjxna nyDnaizauon assay using /\i 1 \jrn.\j& , a iiuurcsccni 
substrate for alkaline phosphatase", pages 264-269, see entire 
document. 


1, 2, 18, 21, 22 


Y 


Genomics, Vol. 11, issued 1991, W. R. Pearson, "Searching 

nmtAin cAnn^nrp lih-rari^c* frtftmsiTi enn th^ Ca*n ti vi fv anH 
piwldll □dfUdlwG il U I <H Ivj. v_*uui}xu idftju vii uiw awiidiu.vii.jr cuiu 

selectivity of the Smith-Waterman and FASTA algorithms", pages 
635-650, see entire document. 


1-23 


v 

X 


Rirw^he»micrrv Vnl ^0 iccii^i 1QQ1 W p derarri "Human 

substance P receptor (NK-1): Organization of the gene, 
chromosome localization, and functional expression of cDNA 
clones", pages 10640-10646, especially Figure 1. 


12 39 40 


Y 
i 


1 I1C dXVLDV^f JUUilUU, W1UI11C O, 1^1 J, 1WUCU 1707, 0 . 1. Jill CLX CI 

al, "Characterization of junD: a new member of the jun proto- 
oncogene family", pages 1433-1439, especially Figure lb. 


35 36 


Y 


Journal of Biological Chemistry, Volume 262, No. 29, issued 15 
October 1987, D. T. Jones et al, "Molecular cloning of five GTP- 
binding protein cDNA species from rat olfactory 
neuroepithelium", pages 14241-14249, especially Figures 2-6. 


10, 11, 13-15, 
37-38 


Y 


Journal of Biological Chemistry, Volume 265, No. 3, issued 25 
January 1990, L. Li et al, "Different members of the jjm proto- 
oncogene family exhibit distinct patterns of expression in response 
to type 0 transforming growth factor", pages 1556-1562, 
especially Figure 5. 


35, 36 
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C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication , where appropriate, f the relevant paaaagea 



Relevant to claim No. 



Journal of Biological Chemistry, Volume 264, No. 30, issued 25 
October 1989, Y. Yokota et al, "Molecular characterization of a 
functional cDNA for rat substance P receptor", pages 17649- 
17652, especially Figure 2. 

Journal of Biological Chemistry, Volume 263, No. 14, issued 15 
May 1988, J. Codina et al, "a r 3 cDNA encodes the a subunit of 
G k , the stimulatory G protein of receptor-regulated K + channels", 
pages 6746-6750, especially Figure 2. 

Journal of Biological Chemistry, Volume 263, No. 14, issued 15 
May 1988, H. Itoh et al, "Presence of three distinct molecular 
species of G; protein a subunit", pages 6656-6664, especially 
Figures 2, 4-6, and 8. 

Proceedings of the National Academy of Science USA, Volume 
85, issued December 1988, K. Hattori et al, "Structure and 
chromosomal localization of the functional intronless human JUN 
protooncogene", pages 9148-9152, especially Figure 5. 

Proceedings of the National Academy of Science USA, Volume 
84, issued August 1987, P. Bray et al, "Human cDNA clones for 
an a subunit of G { signal-transduction protein", pages 51 15-51 19, 
especially Figure 2 and 5-7. 

Proceedings of the National Academy of Science USA, Volume 
84, issued November 1987, C. R. Beals et al, "A small multigene 
family encodes G l signal-transduction proteins", pages 7886-7890, 
especially Figures 1,2, and 4. 

Nucleic Acids Research, Volume 16, No. 8, issued 1988, B. A. 
Harris, "Complete cDNA sequence of a human stimulatory GTP- 
binding protein alpha subunit", page 3585. 

FEBS Letters, Volume 206, No. 1, issued September 1986, R. 
Mattera et al, "Identification by molecular cloning of two forms of 
the a-subunit of the human liver stimulatory (GJ regulatory 
component of adenylyl cyclase", pages 36-42, especially Figure 2. 

FEBS Letters, Volume 219, No. 1, issued July 1987, J. R. 
Didsbury et al, "Molecular cloning of a new human G protein", 
pages 259-263, especially Figures 1 and 2. 

Nucleic Acids Research, Volume 19, No. 17, issued 1991, A. 
Swaroop et al, "Differential expression of novel Gsa signal 
transduction protein cDNA species", pages 4725-4729, see entire 



12, 39, 40 



13, 37, 38 



10, 11, 13-15, 
37, 38 



35, 36 



10, 11, 13-15, 
37, 38 



10, 11, 13-15, 
37, 38 



10, 11, 13-15, 
37, 38 



10, 11, 13-15, 
37, 38 



10, 11, 13-15, 
37, 38 



10, 11, 13-15, 
37, 38 
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B FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

N-CENESEQ, GEN BANK, GENBANK-NEW. UEMBL, EMBL-NEW, CA, MEDLINE. BIOSIS, APS search terms 
Waterman, Pevzner, second hybridization strength, continuous seed filtration, miUuhashi probe selection, calculation of 
melting temperature of nucleic acids, H-site model, nucleation threshold, mismatch model, measuring or assaying beta 
receptor mRNA, measuring or assaying substance P mRNA, measuring or assaying jun mRNA, measuring or assay in c 
Gs, Go, or Gi protein mRNA 7 8 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

1. Claima 1-9, 16-28, 33 and 34, drawn to a method for detecting and quantifying specific mRNAi and a 

polynucleotide immobilized support for use in said method which detects preceptor mRNA. classified in Class 435/6 
for example. 

H. Claim 10, drawn to a method for detecting and quantifying specific Gs protein mRNA, classified in Class 

435/6, for example. 

III. Claim II. drawn to a method for detecting and quantifying specific Gi-2 protein mRNA, classified in Class 
435/6, for example. 

IV Claims 12, 39, and 40, drawn to a method for detecting and quantifying ipecific substance P mRNA and 

polynucleotide support for accomplishing the method, classified in Class 435/6, for example. 

, CUim 13 ' drmWn to 1 mcthod for <*<**tin«" quantifying specific Gi-3 protein mRNA. classified in Class 
435/6, for example. 

yjc, , CUim U * dfmWn 10 * for detecting and quantifying specific Gi-1 protein mRNA, classified in Class 

435/6, for example. 

VII. Claims 15, 37, and 38, drawn to a method for detecting and quantifying specific G protein mRNA and 
polynucleotide immobilized support for accomplishing the method, classified in Class 435/6 for example 

VIII. Claims 35 and 36. drawn to a polynucleotide immobilized support for the detection and quantification of iun- 
oncogene mRNA, classified in Class 536/24.3. 

IX. Claims 29-32, drawn to a method of inhibiting RNase activity in a sample, classified in Class 435/19. 
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Box I Observations where cerUin claims were found unsearchable (Continuation of item 1 of first sheet) 



This interrotional report ho not been established in respect of certain claims under Article 17(2X«) for the following reasons: 
1. I 1 Claims Nos.: 

' because they relate to subject matter not required to be searched by this Authority, namely: 



2. ["1 Claims Nos.: 

*— ' because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. | | Claims Nos.: 

because they are d e p e nd ent claims and are not drafted in accordance with the second and third sentences of Rule 6.4<s). -., 



Box II Obaerratioas where unity of usTeotno is lacking (Continuation of stem 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

(Telephone practice) 
Please See Extra Sheet. 



1 . I x| ^ *U required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. | | No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark n Protest | | The additional search fees were accompanied by the applicant's protcstr 

| | No protest accompanied the payment f additional search fees. 
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INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US93/00977 



Box I Obserratkms where certain claims were found unsearchable (Continuation f item 1 f first sheet) 



This international report has not been established in respect of certain claims under Article 17(2Xa) for the following reasons: 



□ 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



□ 



Claims Nos.: 

becatuethey relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Nos.: 

becauscthey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
I Box II Qbserrations where unity of invention is lacking (Continu ation of item 2 of first sheet) 
| This International Searching Authority found multiple inventions in this international application, as follows: 

(Telephone practice) 
Please See Extra Sheet. 



1. [x] As all required additional search fees were timely paid by the appUcant. this international search 



claims 



report covers all searchable 



2. 



' oY^y^Snal fee™ ^ ^ ***** ******* ™ additional fcc > *** Authority did not invil 



invite payment 



^ i°£SS f ,dditi0Ml fCCS WenS ^ Paid ^ »PP Ucant - National search report cover, 

only those claims for which fees were paid, specifically claims Nos.: ^ 



4 ' ^ ^ ^ ^ by appUcant CQ ™^™«y> this international search report is 

restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark n Protest 



Q The additional search fees were accompanied by the applicant's protest" 

□ 

No protest accompanied the payment of additional search fees. 
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CORRECTED 
VERSION * 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
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METHOD FOR MEASURING MESSENGER RNA 
Backgro und of the Invention 

Field of the Invention 

This invention is related to a method for measuring messenger RNA (mRNA) and 
5 polynucleotide-immobilized supports useful in this method. Specifically, this invention is 

related to measuring methods and reagents in which various types of mRNA can be 
measured at the same time. In one embodiment of this invention, the mRNA need not be 
purified from the cell, and the mRNA can be rapidly quantified with high sensitivity. 
Description of Previo us Techniques 

Previously well-known techniques for measuring mRNA levels include the Northern 
Blot Method (See, e.g., J. Sambrook, et al.. Molecular Cloning, 2nd ed., Cold spring Harbor 
Laboratory Press, 1989 hereafter referred to as "Molecular Cloning"), the Dot Blot Method 
(See, e.g.. Molecular Cloning), Ribonuclease Protection Assay (See, e.g., Molecular Cloning), 
and the Reverse PCR Method (HA. Erlich (ed.), PCR Technology-Principles and 
15 Applications for DNA Amplification, Stockton Press, New York, NY, 1989). The disclosures 

of all of these previous references are hereby incorporated by reference. 

The Northern Blot Method detects mRNA which is separated by molecular weight 
using electrophoresis. The separated molecules are immobilized on a membrane and 
mRNA levels are determined by hybridization of marker DNA probes. With this method, 
20 the molecular weight of the subject mRNA must generally be known. By using this method 

a complex between a labeled probe, mRNA and an insoluble membrane is formed. The 
amount of label in the complex at the appropriate site on the membrane is then measured 
coiorimetrically or by radioactive emissions. 

The Dot Blot Method detects mRNA by hybridization with a labeled probe after 
25 purified mRNA is immobilized as a spot on a membrane. This method can be used to 

rapidly examine many samples at one time. However, both the Dot Blot and the Northern 
Blot methods require the use of a specific labeled probe for each mRNA which is desired 
to be detected. 

The Ribonuclease Protection Assay method is used to detect mRNA after it is 
30 initially purified. The purified mRNA is hybridized with a labeled RNA probe, and then 

treated with a ribonuclease that specifically digests single stranded RNA. The labeled RNA 
probe will form a duplex in the solution and be protected from digestion. This is then 
immobilized on a membrane after separation of the double-stranded RNA/probe complex. 
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The reverse PCR method is used where cDNA is synthesized using reverse 
transcriptase and purified mRNA. This cDNA can be immobilized to a membrane. The 
synthesized cDNA is then amplified using standard PCR (Polymerase Chain Reaction) 
methods. After separating cDNA according to its moiecuiar weight by electrophoresis, it 
becomes a single band. The DNA is then immobilized on a membrane, and detected by 
hybridization with a labeled DNA probe. However, after PCR, no quantitative information 
remains due to the amplification of the original message. 

All of the foregoing methods take extended periods of time to perform since the 
mRNA used must be in at least a partially purified form. Also, electrophoresis of the 
samples and immobilization to a membrane is necessary. Both of these steps reduce the 
reliability of quantification of the mRNA. 

Thus, there remains a need to resolve the above problems, and to provide a method 
of measuring for rapidly determining messenger RNA levels and a measuring reagent useful 
in such a method. 
15 Summary of the Invention 

In the mRNA assay methods of the present invention, a first and a second 
polynucleotide probe can be used. The inventors have discovered that immobilization of 
probes (a first polynucleotide) rather than samples, on a water-insoluble support can provide 
a basis for the rapid quantification of multiple unpurified samples containing mRNA. 
20 Applicants have provided an efficient method of identifying appropriate specific probes for 

this purpose. In preferred forms of the present invention, the first polynucleotide probe has 
a nucleotide sequence complementary to a region which is unique in the sample to the 
mRNA and does not bind other portions in the target mRNA. The inventors also 
discovered that a single kind of second polynucleotide probe can be used for simultaneous 
25 quantification of various mRNAs. However, any of a variety of probes that bind to a 

different region of the mRNA than the first probe can be used for this purpose. In this 
regard, the inventors have found that a second polynucleotide probe containing sequences 
complementary to a polyadenylic acid tail of mRNA can be used in such a simultaneous 
assay method. Based on these and other discoveries, the inventors established the present 
30 invention. 

One embodiment of the present invention is a highly sensitive, quantitative and rapid 
method for detecting and quantifying mRNA in a sample without the need to purify mRNA 
from cells. In this embodiment of the invention, the method comprises the following steps: 
(a) identifying a polynucleotide sequence that is unique to the mRNA; 
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(b) immobilizing a first polynucleotide to an insoluble support, the first 
polynucleotide having a first sequence complementary to the sequence unique to the 
mRNA; 

(c) incubating the sample with the insoluble support under conditions 
5 wherein the unique sequence will hybridize with the first polynucleotide, thereby 

immobilizing mRNA present in the sample to the insoluble support; 

(d) washing non-immobilized components of the sample from the 
insoluble support; 

(e) labeling mRNA on the support in a manner that label is incorporated 
10 onto the support related to the amount of mRNA on the support; and 

(f) measuring the amount of label immobilized on the support. 

In a preferred embodiment, the above step of labeling mRNA on the support involves 
incubating a labeled probe, most preferably poly-d(T), with the insoluble support under 
conditions wherein the labeled probe will hybridize with mRNA on the support, and the 

15 labeled probe bears a label and is complementary to a part of the mRNA other than a part 

of the mRNA complementary to the first probe, thereby immobilizing to the insoluble 
support the label on the labeled probe which has hybridized with the mRNA; and thereafter 
washing any non-immobilized labeled probe from the insoluble support. 

In addition, the above method of labeling nucleic acids in step (e) can preferably 

20 involve labeling the oligonucleotides with a nucleic acid stain, most preferably from the 

group consisting of ethidium bromide, yoyo-1 and toto-L 

In a further preferred embodiment, the above method can utilize a label, wherein 
the label comprises a radionuclide, or alternatively biotin. In the method using biotin, the 
incubation can preferably be with enzyme-conjugated streptavidin; followed by measurement 

25 of the amount of enzyme bound to the insoluble support. In this embodiment step (f) can 

additionally comprise adding streptavidin bound to alkaline phosphatase to the insoluble 
support. 

In still another embodiment of the preferred method, the mRNA can code for a p 
receptor, and wherein the first polynucleotide comprises the sequence of SEQ ID NO:6. 
30 An alternative embodiment of the above method includes mRNA coding for the a subunit 

of Gs protein, wherein the first polynucleotide comprises a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NO:l, 145-146. 209-212. 268-287, 
705 and 708. 
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Yet another alternative mRNA sequence that can be used with the present invention 
method codes for the Gi-2 protein, wherein the first polynucleotide comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NO:703, 806, 
142, 143, 187-197 and 238-253. 

The present method can also preferably involve the use mRNA coding for substance P 
receptor, wherein the first polynucleotide comprises the sequence of SEQ ID NO:5. 

mRNA encoding Gi-3 protein can also be used in the present method, wherein the 
first polynucleotide comprises a sequence complementary to a sequence selected from the 
group consisting of SEQ ID NO:704, 707, 144, 198-208 and 254-267. 

Still another embodiment of the above method can involve the mRNA coding for 
the Gi-1 protein, wherein the first polynucleotide comprises a sequence complementary to 
a sequence selected from the group consisting of SEQ ID NO:702, 805, 141, and 224-237. 
An even further preferred embodiment includes mRNA coding for G protein, wherein the 
first polynucleotide comprises a sequence complementary to a sequence selected from the 
15 group consisting of SEQ ID NO:70a 701, 139, 140, 148-159, 162-172, 176-186 and 3-4. 

Most preferably, the first sequence hybridizes to a unique sequence of the mRNA 
and not to other polynucleotides present in the sample, and even more preferably, the 
sample comprises a cell lysate. 

In an alternative preferred embodiment, the label in the above method can be 
20 measured by light emitted therefrom, and step (f) of the method comprises measuring the 

amount of light emitted by the label. The method of measuring the light can preferably 
involve; recording the amount of light on film; and measuring the exposure of this film using 
a densitometer. In addition, the light measurement can alternatively comprise adding 
ATTOPHOS and measuring fluorescence emitted using a fluorimeter. 
25 The present invention method in step (a) above can advantageously include a 

computer program, with the computer program preferably involving use of an H-site model. 
The H-site model of step (a) is preferably used to identify the first polynucleotide probe of 
the method by the steps of: 

specifying a minimum melting temperature for the first nucleotide 
probe and the nucleotide sequence specific to the organism; 

specifying a nucleation threshold that places a minimum value on the 
number of base pairs at any nucleation site; 
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determining the melting temperatures (T m ) of the first nucleotide 
probe and the sequence specific to the organism at every possible 
hybridization point; and 

selecting the nucleotide probe having the highest T m value. 
5 The computerized method of identifying the first polynucleotide probe can preferably 

have the melting temperature determined by the formula; 

T m = 81.5-16.6(log[Na])-0.63%(formamide) + 0.41(%(G + C))-600/N, wherein Log[Na] is the 
log function of the sodium concentration, 0.063%(formamide) is the concentration of 
formamide, %(G + C) is the percentage of matched GC base pairs, and N is the probe 
10 length. 

Additionally the method of step (a) can be used to identify the labeled probe 
referred to above using the H-site computerized model by the steps of: 

specifying a minimum melting temperature for the labeled probe and 
the nucleotide sequence specific to the organism; 
15 specifying a nucleation threshold that places a minimum value on the ::: 

number of base pairs at any nucleation site; 

determining the melting temperatures (T m ) of the first nucleotide 
probe and the sequence specific to the organism at every possible 
hybridization point; and 
20 selecting the nucleotide probe of the proper length having the lowest 

T m value. 

A method of identifying the second nucleotide probe can involve determining the 
melting temperature by the formula: 

T m = 81.5-16.6(log[Na])-0.63%(formamide)4-0.41(%(G + C))-600/N. wherein LogfNa] is the 
25 log function of the sodium concentration, 0.063%(formamide) is the concentration of 

formamide, %(G + C) is the percentage of matched GC base pairs, and N is the probe 
length. 

A different embodiment of the present invention is a method of inhibiting RNase 
activity in a sample containing RNA in a buffer system containing SDS and EDTA. 
30 comprising adding vanadyl ribonucleoside complex to the sample. 

Preferably, there is at least a five-fold molar excess of EDTA over RNA in the sample and 
most preferably, the vanadyl ribonucleoside complex is added in combination with proteinase 
K. In an alternative preferred embodiment, the vanadyl ribonucleoside complex is added 
in combination with RNasin. 
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Yet another different embodiment of the present invention is a polynucleotide- 
immobUized support that is useful in detecting and quantifying the amount of p receptor 
mRNA in a sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to p receptor 
mRNA. 

More preferably, the sequence unique to p receptor mRNA comprises a sequence selected 
from the group consisting of SEQ ID NO:6. 

StilJ another preferred embodiment of the present invention is a polynucleotide- 
immobiiized support that is useful in detecting and quantifying the amount of jun oncogene 
mRNA in a sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to jun oncogene 
mRNA. 

Preferably, the sequence unique to the jun oncogene mRNA comprises a sequence selected 
from the group consisting of SEQ ID NO.11-35, 39-53. 62-138. 310-342, 600-606, 614-620, 
and 628-634. 

Another embodiment of the present invention is a polynucieotide-immobilized 
support that is useful in detecting and quantifying the amount of G protein mRNA in a 
sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to G protein 
mRNA. 

Preferably, the sequence unique to G protein mRNA comprises a sequence selected from 
the group consisting of 1, 2. 700-709, 805, 806. 139-159. 162-172 and 176-309. 

Another preferred embodiment of the present invention is a polynucleotide- 
immobilked support that is useful in detecting and quantifying the amount of Substance P 
receptor mRNA in a sample, comprising: 
an insoluble support; 



SUBSTITUTE SNIIET 



WO 93/15221 PCT/US93/00977 

-7- 

a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to Substance P 
receptor mRNA. 

Preferably the sequence unique to Substance P receptor mRNA comprises the sequence of 
5 SEO ID NO:5. 

Brief Description of the Figures 
Figure 1 illustrates an immobilization procedure of a first polynucleotide probe onto 
an insoluble support by the maleimide method. 

Figure 2 illustrates an immobilization procedure of a first nucleotide probe onto an 
10 insoluble support by the carbodiimide method. 

Figure 3 schematically shows the principle of an mRNA quantification method in 
accordance with the present invention. 

Figure 4A shows a photographic image of an X-ray film exposed to 
chemiluminescent light derived from the human p 2 adrenergic receptor cDNA probe on a 
15 Northern membrane, in which various concentrations of the human P-> adrenergic receptor 

mRNA were immobilized. 4B shows a photographic image of an X-ray film exposed to 
chemiluminescent light derived from the oligo-(dT) probe in accordance with the present 
invention in which various concentrations of the human P 2 adrenergic receptor mRNA were 
trapped onto the first oligonucleotide-immobilized plastic plate. 4C shows a densitogram 
20 of the image shown in 4B. 4D graphically depicts the relationship between relative intensity 

of chemiluminescent signals and concentrations of mRNA of the image shown' , in 4B. 

Figure 5 graphically depicts the linear relationship between relative intensity of 
chemiluminescent signals and concentrations of mRNA in a separate experiment from that 
shown in Figures 4B-D. 

25 Figure 6 graphically depicts a calibration line for the rat Gs protein mRNA of the 

invented method. 

Figure 7 shows a photographic image of a polaroid film exposed to chemiluminescent 
light derived from oligo-(dT) probe by the invented method, in which Gi-2 and GS protein 
mRNA in various cells were trapped onto the first polynucleotide-immobilized plastic plate. 
30 Figure 8 shows a photographic image of an X-ray film exposed to chemiluminescent 

light derived from the Gi-2 and Gs protein cDNA probes on the Southern membrane, in 
which PCR-amplif led G protein specific DNA from 4 different cell lines was immobilized. 

Figure 9 graphically depicts a calibration line for the human substance P receptor 
mRNA of the invented method. 
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Figure 10 shows a gel showing the effect of various RNase inhibitors on mRNA 
preparations containing SDS and EDTA. 

Figure 11 shows a gel of five different G protein oligonucleotides amplified with the 
G 2 and G 4 PCR primers and also provides Southern blots using each of the five G protein 
5 sequences as a probe. 

Figure 12 shows a gel of samples containing varying amounts of each of five different 
G protein oligonucleotides (as indicated by number of " + " symbols) amplified with the G 2 
and G 4 PCR primers and also provides Southern blots using each of the five G protein 
sequences as a probe. 

10 Figure 13 shows a gel of XZAP cDNA libraries from rate pituitary (P), kidney (K) 

and intestinatal (I) amplified with each of five different G protein primers, as indicated. 

Figure 14 shows a gel of 500 bp DNA from cDNAs of human IM9 and Jurkat cells 

amplified with G 2 and G 4 PCR primers. 

Figure 15 is a graph showing the relationship between YOYO-1 fluorescense and 
15 amount of immobilized oligonucleotide. 

Figure 16 is a graph showing the time course of YOYO-1 analysis- 
Figure 17 is a graph showing the dose response of YOYO-1 concentration. 
Figure 18 is a bar graph showing the reproducibility of YOYO-1 staining based on 

quantitation of immobilized oligonucleotides. 
20 Figure 19 exhibits a Polaroid picture of agarose gel electrophoresis of the PCR 

amplified ds-cDNA of jun oncogene from human peripheral blood leukocytes. 

Figure 20 is a simplified block diagram of a computer system illustrating the overall 

design of this invention; 

Figures 21A-21C show display screen representations of the main oligoprobe design 
25 station dialog windows of this invention; 

Figures 22A and 22B are flow charts of the overall invention illustrating the program 

and the invention's sequence and structure; 

Figure 23 is a display screen representation of the Mitsuhashi probe selection 

diagram; 

30 Figure 24 is a display screen representation of the probeinfo and matchinfo window; 

Figure 25 is a display screen representation of the probesedit window; 
Figures 26A and 26B are printouts of the probesedit output file; 
Figure 27 is a flow chart of the overaLI k^diff program of the Mismatch Model of this 
invention, including its sequence and structure; 
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Figures 28A and 28B are flow charts of the k diff module of this invention; 
Figure 29 is a flow chart of the hashing module of this invention; 
Figure 30 is a flow chart of the tran module of this invention; 
Figure 31 is a flow chart of the let_dig module of this invention; 
5 Figure 32 is a flow chart of the update module of this invention; 

Figures 33A and 33B are flow charts of the assembly module of this invention; 
Figures 34A and 34B are flow charts of the seqload module of this invention; 
Figures 35A and 35B are flow charts of the readl module of this invention; 
Figure 36 is a flow chart of the digjet module of this invention; 
Figures 37A and 37B are flow charts of the q_colour module of this invention; 
Figure 38 is a flow chart of the hit_ext module of this invention; 
Figure 39 is a flow chart of the colour module of this invention; 
Figure 40 is the first page of a printout of a sample file containing the output of the 
Mismatch Model program of this invention; 
15 Figure 41 is a flow chart of the H-Site Model stage I, covering the creation of a 

preprocessed preparation file of this invention; 

Figure 42 is a flow chart of the H-Site Model, stage II, covering the preparation of 
the target sequence(s); 

Figures 43A-43D are flow charts of the H-Site Model, stage III, covering the 
20 calculation of MPSD data; 

Figure 44A is the first page of a printout of a sample file containing output ;.of 
Mismatch Model program; 

Figure 44B is the first page of a printout of a sample file containing output of H-Site 
Model program; 

25 Figure 45 is a flow chart of the processing used to create the Mitsuhashi probe 

selection diagram (MPSD); 

Figure 46 is a flow chart of processing used to create the matchinfo window; 
Figure 47 is a printout of a sample target species file; 
Figures 48A-48C are printouts of a sample preparation file. 
30 Detailed Description of the Invention 

Target mRNA 

Previously, in order to genetically diagnose an infectious disease, procedures utilizing 
DNA were generally performed. Although, with DNA diagnosis, the existence of an external 
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gene can be confirmed, it cannot be determined if the known gene is in an unexpressed 
carrier state or expressed in the diseased state. 

In living organisms, once a gene is activated, the DNA coding for that gene is 
transcribed into mRNA, and the mRNA is subsequently translated into protein. Thus, the 
5 activation of the external gene can be determined by measuring the levels of mRNA. 

Currently, to genetically diagnose a protein deficiency state, most experimenters have 
limited themselves to analysis at the DNA level. However, this method cannot be used to 
determine abnormalities of transcription or translation. If mRNA levels could be measured 
easily, the diseased state might be more precisely recognized. In a diseased state where 
10 mRNA cannot be detected, it can be said that there is a defect in transcription, while if 

normal levels of mRNA are detected, there is probably an abnormality in translation. 

In addition, the amount of mRNA produced in the cell reflects the amount of 
expression of a gene, so it would be useful in the course of an abnormal condition, such as 
an illness or drug reaction, to examine mRNA levels. As discussed in greater detail below, 
15 measuring any of a large number of mRNAs can provide clinically useful information. 

Particular examples include mRNA for the p receptor, substance P, G protein and 
oncogenes, such as jun. 

For the aforementioned reasons, the ability to measure mRNA would be useful for 
diagnosing and recognizing the pathophysiology of various disease states. This method 
20 would be especially useful for diagnosis of hereditary diseases, cancer, and infectious 

diseases. 

A preferred target of this invention is mRNA of eukaryote cells which contain 
polyadenylic acid tail on their 3'-terminal end. Once the specific nucleotide sequence of the 
mRNA is known, virtually any mRNA can be measured using the methods of the present 
25 invention. Examples of such mRNAs are human G protein, human p 2 receptor, substance 

P, lymphocyte surface antigen, immunoglobulin, collagen, and adrenaline receptor. 
Test Samples 

Test samples containing target mRNA can be obtained from a variety of sources. 
Preferred sources include living samples or cell solutions which have been treated to 
30 inactivate ribonuclease (RNase) activity. Purification of mRNA for standard samples can 

be conducted according to the methods usually associated with Northern Blot procedures 
or Dot Blot procedures method. Kits for the purification of mRNA are commercially 
available. 
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A test sample can be prepared from a living sample by treatment with 10 mM of 
ethylenediaminetetraacetic acid (EDTA) (pH 8.0). 0.2 M NaCl, 0.5% of sodium dodecyl 
sulfate (SDS), 500 Unit/ml of RNase inhibitor, lOmM of Vanadyl Ribonucleoside Complex 
and 200 ng/1 of Proteinase K (hereafter called Lysis buffer) to lyse the cells and inhibit 
5 RNase activity. After lysis of the cells, NaCl concentration is adjusted to 0.5M. 

In the methods of the present invention that involve probing samples containing 
RNA, or in any procedure in which it is desired to prevent the degradation of RNA. it is 
advantageous to inhibit the activity of any ribonucleases (RNases) which may be present. 
One Rnase inhibitor which can be used is Vanadyl Ribonucleoside Complex (VRC) 
10 (Bethesda Research Laboratories, Gaithersburg, MD). VRC has been reported to be useful 

during cell fractionation and in the preparation of RNA, and has been shown not to 
interfere with the phenol extraction or ethanol precipitation of RNA. In addition. VRC 
does not affect other cytoplasmic components of cells. Therefore, VRC is an ideal inhibitor 
of RNase in many experimental procedures. 
15 However, prior art procedures for inhibiting RNases with VRC taught that VRC 

should not be used in buffer systems containing EDTA or SDS, which are commonly used 
in the field of molecular biology. The reason for this prohibition was that it was believed 
that the Complex would dissociate in the presence of these buffers, leading to a loss of 
RNase inhibiting activity. In fact, the BRL insert accompanying the VRC product 
20 recommends that a five- to ten-fold molar excess of EDTA be added to an RNA solution 

containing VRC in order "destroy" the VRC prior to ethanol extraction of the RNA from 
the solution. The apparent inability to use VRC together with common buffers that include 
EDTA and SDS thus presented a major impediment to the exploitation of VRC as an 
RNase inhibitor. 

25 We have discovered, however, that VRC is in fact an effective RNase inhibitor even 

in the presence of SDS and/or EDTA. Thus, VRC can be used in assays which use buffers 
including SDS or EDTA. where heretofore it was believed that VRC would not be effective 
in such systems. VRC is an effective RNase inhibitor at the concentrations of EDTA and 
SDS that are normally used when manipulating RNA or when performing a variety of other 

30 molecular biology techniques. For example, we have found that VRC effectively inhibits 

RNase in a buffer solution comprising approximately 1 mM EDTA. We also found VRC 
to be effective in .5% SDS solutions, and is believed to be effective in solutions ranging up 
to 2% SDS, more preferably up to \ c /n SDS. VRC can, of course, also be used in other 
solutions includine EDTA and SDS. 
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We have found that VRC is a particularly potent inhibitor of RNases when used in 
combination with Proteinase K. Proteinase K is also available from BRL. As shown in the 
gel in Figure 10, mRNA prepared from U937 cells (human macrophage cell line) was 
protected from RNase degradation by a combination of VRC and Proteinase K. Lane 1 of 
5 the gel shows the mRNA from a cell preparation which included VRC, Proteinase K, and 

RNasin, while lane 2 represents the mRNA from a cell preparation that included VRC and 
Proteinase K. RNasin is available from Promega of Madison, WI. The distinct band 10 in 
lanes 1 and 2 matches the band seen in lane 7, which contains pure clonal cDNA from an 
RNase-free preparation of U937 cells, thus showing that the mRNA in the preparations of 
lanes 1 and 2 did not experience substantial mRNA degradation. 

A comparison of lanes 1 and 2 with lanes 3-6 shows that VRC in combination with 
Proteinase K inhibits RNase activity in the above-mentioned mRNA preparation from U937 
cells to a far greater extent than either Protein K alone (lane 4), Proteinase K in 
combination with RNasin ( lane 3), or a commercial RNase-inhibiting preparation sold under 
15 the name "FastTrack" (available from In Vitrogen of San Diego, CA) (lane 5). None of 

lanes 3-5 exhibit the distinct band 10 representing unclegraded mRNA. On the contrary, 
lane 4 (Proteinase K alone) and lane 5 (FastTrack) have the same smear of degraded 
mRNA as band 20 in lane 6 (no inhibitors). The gel shown in Figure 10 also shows the 
effectiveness of VRC in a buffer solution of i mM EDTA and .5% SDS, which is the buffer 
20 used in the U937 cell preparations tested, since lane 2 (VRC and Proteinase K) shows less 

mRNA degradation than lane 4 (Proteinase K alone) or lane 3 (Proteinase K and RNasin). 
First Polynucleotide Probe 

In the practice of the present invention, a first polynucleotide probe is bound to an 
insoluble support. This first polynucleotide is used to trap the target mRNA to a solid 
25 support. Thus, the first polynucleotide probe comprises a nucleotide sequences which can 

hybridize with a sequence of the mRNA that does not occur in other RNA in the sample. 
Such a sequence is referred to herein as a unique region of the mRNA. Preferably, the first 
polynucleotide probe does not bind to portions of the target mRNA other than the unique 
region. 

A unique region can be identified by any of several methods. These include: (1) 
comparing the nucleotide sequence between several potential mRNAs and their target 
mRNAs; (2) getting a candidate sequence which can be compared with other nucleotide 
sequences; and (3) computer searching for nucleotide sequences which do not pair with the 
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target mRNA. Information on various known mRNA sequences can be obtained from 
several genetic information databases. 

The first polynucleotide probe is preferably an oligodeoxyribonucleotide (DNA) 
rather than an oligoribonucleotide (RNA) since DNA is more stable than RNA. The 
5 number of nucleotides is not restricted; however, if an oligodeoxyonucleotide is used as the 

first probe, a preferred length is 17-40 nucleotides. If a cDNA of target RNA is available, 
a part or the entire cDNA can be used as the first nucleotide probe. The first nucleotide 
probe can also be easily manufactured by using a DNA synthesizer or by a commercial 
manufacturer. 

10 In this invention, the first nucleotide probe used is immobilized to an insoluble 

support. It may be chemically modified for immobilization, such as by adding an amino 
group onto the 5'-terminal or 3*-terminal of the polynucleotide. 
Insoluble Support 

In the practice of the present invention, the first nucleotide probe is immobilized to 

15 an insoluble support. The insoluble support can be any of a variety of insoluble materials, 

such as a plastic plate, membrane filter, microliter plate or any other insoluble material to 
which polynucleotides can be attached. The insoluble support is preferably made of a 
material which can immobilize the first nucleotide probe by covalent bonds. For example, 
a plastic plate, such as polystyrene, which has a carboxyl group or amino group on the 

20 surface thereof can be used. Such plates are available commercially. However, procedures 

for introducing these residues are also well known in the art. One commercially available 
plastic plate which has carboxyl group, on its surface is Sumilon microtiter plate MS-3796F 
made by Sumitomo Bakelite. A plastic plate having an amino group on the surface is the 
Sumilon microtiter plate MS-3696F. 

25 Immobilization of First Polynucleotide 

In the practice of the present invention, the first polynucleotide probe is immobilized 
onto the insoluble support. Various methods of immobilizing polynucleotides to the 
insoluble support are available, including covalent binding, ionic binding, and the physical 
absorbance method. However, the covalent binding method is preferred. Thus, in certain 

30 embodiments of the present invention, the polynucleotides are immobilized to microtiter 

plates which exhibit functional groups, such as carboxyl residues, amine residues, or hydroxyl 
residues on the surface thereof. 

In a preferred procedure for immobilization of the polynucleotide to an insoluble 
support exhibiting a functional group, the 5*-terminal end of the polynucleotide is covalently 
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linked to the functional group. Any of a variety of methods for covalent binding of 
polynucleotides to these functional groups can be used. Examples of preferred well-known 
methods include the maleimide method and carbodiimide method. 

The maleimide method, illustrated in Figure 1, involves the reaction between a 
5 substance containing a maleimide group and another material containing a sulfhydryl residue 

(SH). In order to attach the 5' end of a polynucleotide to an immobilized support using the 
maleimide method, the 5' end of the polynucleotide is reacted with a maleimide compound. 
A suitable maleimide compound is sulfosuccinimidyl-4-(N-maleimidomethyl) cyclohexane-1- 
carboxylate (suIfo-SMCC). 
10 The SH residue is provided on the support by a reaction between a support having 

an amino group and succinimidyl-S-acetylthioacetate (SATA), followed by deacetylation 
using hydroxylamine (NH 2 OH). (Sulfo-SMCC and SATA are readily available from a 
variety of commercial sources, including the Pierce Company.) The resulting SH group on 
the support can then react with the maleimide group on the 5' end of the polynucleotide to 
15 form a polynucleotide-immobilized support. One problem we have experienced in 

the use of the maleimide method is that the SH group on the plate can react not only with 
an amino group at the 5' end of the polynucleotide, but can also react with primary amino 
groups on the purine bases, adenine and guanine. In order to assure that the 
polynucleotides are immobilized at their 5' ends, the amino groups on the purine bases can 
20 be protected by pairing the polynucleotide to a complementary polynucleotide prior to 

immobilization. After immobilization, the complementary polynucleotide can be removed 
through denaturation, such as through heating. 

The carbodiimide method is illustrated in Figure 2. This method involves a reaction 
between an amino group and a material containing a carboxyl residue using a carbodiimide 
25 compound. An example of a carbodiimide compound is l-ethyI-3-(3-dimethyIaminopropyl) 

carbodiimide hydrochloride (hereafter called EDC). This reaction can be enhanced with 
N-hydroxysulfosuccinimide (hereafter called Sulfo-NHS). Both EDC and Sulfo-NHS are 
available from well known commercial sources, including the Pierce Company. 

In the practice of a preferred carbodiimide method for attaching polynucleotides to 
30 a support, a support having a carboxyl residue attached is used. EDC is activated by 

reaction with Sulfo-NHS. This activated EDC is reacted with support containing surface- 
bound carboxyl residues. This can then be reacted with polynucleotides having an amino 
group at their 5'-terminaI ends, resulting in a polynucleotide-immobilized support. 
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In order to assure that the polynucleotides are immobilized at their 5 - end, primary 
amino groups on adenyl, guanyl and cytosyl groups can be protected by hybridizing the 
nucleotide to a complementary polynucleotide prior to immobilization. After 
immobilization, the complementary polynucleotide can be removed through denaturation, 
5 such as through heating. 

We have found that non-specific binding of activated amino or carboxyl residues on 
insoluble supports can be effectively reduced or eliminated by treating the plates to which 
polynucleotides have been immobilized with a primary amine compound, preferably glycine. 
Second Polynucleotide Probe 
10 The second nucleotide probe contains sequences which can also hybridize to target 

mRNA that is trapped by the first bound nucleotide sequence. Preferably, the second 
polynucleotide probe hybridizes to a different portion of the target mRNA than does the 
first polynucleotide probe. In an especially preferred embodiment, the second nucleotide 
probe consists of nucleotides which are complementary to the polyadenylic acid.tail of the 
15 target mRNA. The second nucleotide probe is also preferably DNA rather than RNA due 

to the higher stability of DNA. The number of nucleotides is not limited; however, a 
polynucleotide of 15-40 nucleotides is preferable. Thus a poly (dT) sequence of 15-40 
nucleotides is a preferred second polynucleotide probe. Such a second polynucleotide probe 
would serve as a universal second probe for virtually any target mRNA from eukaryotic 
cells. However, if a cDNA to the target RNA is available, a part or all of this cDNA can 
also function as a second polynucleotide probe. The second nucleotide probe can also be 
easily manufactured by a DNA synthesizer or commercial manufacturer. 

The second polynucleotide probe is preferably labeled in order to easily detect its 
presence. A variety of chemical substances are available which can label the target 
25 substance. For example, a variety of radionuclides can be used, such as the radioisotopes 

32 P, 35 S, 3 H, and 125 I. Enzymes can also be used to label the second polynucleotide probes. 
Suitable enzymes include alkaline phosphatase, luciferase. and peroxidase. Other labels 
might be chemical compounds such as biotin, avidin. streptavidin, and digoxigenin. Labels 
which provide a colorimetric indication or a radionuclide are preferred. In a preferred 
30 embodiment, biotin is attached to the nucleotide probe, followed by an avidin, such as 

streptavidin, conjugated to an enzyme such as alkaline phosphatase. The presence of the 
enzyme can be detected by various substrates, such as ATTOPHOS which provides a 
flourescent marker that can be detected throuch fluorimetrv. 
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In certain embodiments, part or all of the labeling compound is substituted with part 
or all of the second nucleotide probe. Techniques for substituting portions of 
polynucleotides are well known for a variety of labels and kits are commercially available 
for many of these. For example, one can use a DNA 3'-end Labeling Kit or oligo- 
5 deoxythymidylic acid to label the probe. 

Nucleic acid can also be "labeled" by staining with a nucleic acid stain. Thus, where 
a relatively large amount of nucleic acid is present, ethidium bromid (EtBr) can be used to 
identify the presence of nucleic acid. This is useful within the context of the present 
invention, for example, if mRNA has hybridized to a short immobilized anti-sense probe 

10 then substantially more nucleic acid wiH be immobilized than if no mRNa had hybridized. 

However, more sensitive stains are more preferable. Such more sensitive stains include the 
various cyanine nucleic acid stains, such as POPO, BOBO, YOYO and TOTO available 
from Moecular Probes (California). These stains are described, e.g., in Science, 257:885 
( 1992). Particularly preferred stains for use in the context of the present invention include 

15 the shorter wavelength forms, TOTO-1 and YOYO-1, still more preferably YOYO-1. As 

little as four picograms of stained DNA can be detected by visible fluorescence upon 
stimulation with a transilluminator or hand-held UV lamp. Thus, these stains provide a 
particularly easy and sensitive method of identifying the presence of nucleic acids. 

We tested the ability of YOYO-1 to stain oligonucleotides immobilized to wells as 

20 discussed herein. We first immobilized various known amounts of oligonucleotides to wells 

and washed to remove non-immobilized oligonucleotides. We then added 1:1000 dilution 
of YOYO- 1 in water. We then used a fluorimeter directly without washing. The relation 
between pmoles of oligonucleotide is shown in Figure 15 with open circles. We also 
incubated the TOTO-1 stained immobilized oligonucleotides for ten minutes, washed and 

25 added water, followed by use of the fluorimeter. The washed results are shown in dark 

circles in Figure 15. It can be seen that washing does not signicantly effect the amount of 
staining and that the amount of staining is related to the amount of oligonucleotide. 

We also tested the time course of YOYO-1 staining over the course of one hour. 
We included a control in which no oligonucleotide was immobilized to the plate. The 

30 oligonucleotide-immobLlized plate is shown in open circles in Figure 16 and the control in 

dark circles in that Figure. It can be seen from Figure 16 that after an initial spike, 
relatively constant staining is found. 

We further tested the dose response of constant amounts of oligonucleotides with 
oligonucleotides immobilized in wells (open circles in Figure 17) and control wells with no 
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oligonucleotide (closed circles in Figure 17). The difference between the immobilized and 
non-immobilized is shown in Figure 17 as triangles. It can be seen that a sharp increase in 
fluorescence occurs between I0' 4 and 10" 3 dilution. 

We also tested the reproducability of staining of both oligonucleotide-immobilized 
5 wells and non-immobilized wells. We repeated the experiment five times and graphically 

depicted the data for both oligonucleotide immobilized ( + ) and non-immobilized (-) wells 
in Figure 18. It can be seen that the data was substantially similar for each experiment. 

Thus, the data depicted in Figures 15-18 shows that use of YOYO-1 as a staining 
agent provides a reliable indication of the amount of oligonucleotide present. 
10 A pplication of Sample 

In the preferred methods of the present invention, a sample containing mRNA is 
applied to the insoluble support to which the first polynucleotide probe has been bound. 
Any target mRNA found in the sample is allowed to hybridize with the first polynucleotide 
probe. This can be accomplished by incubation at temperature dependent on a, variety of 
15 factors, as is well known to those with ordinary skill in the art. These factors include the 

length of complementary nucleotide sequences, the ratio of Guanine or Cytosine (GC 
contents) within the entire base content in the complementary nucleotide sequences, NaCl 
concentration in buffer, the number of bases which mismatch in the complementary 
nucleotide sequence, and the type of nucleotide. In a preferred form of this invention, the 
20 following equation can be used to calculate the preferred incubation temperature: 

T inc , 16.6 x log (M) + 0.41 (GC) + 81.5 - 675/n - 15 (oQ. 
In the equation shown above, M is the NaCl concentration (M) in solution, GC 
represents GC contents (%), n represents the length of nucleotide sequences (the number 
of nucleotides), 

25 The incubation time can also be determined according to methods described in the 

Molecular Cloning manual. 

The time for incubation is preferably from 1 hour to overnight, and the sample 
should preferably be gently swung during incubation. Incubation is preferably performed 
in an appropriate buffer solution. The same buffer used to hybridize RNA and DNA in the 
Northern Blot or the Dot Blot Methods can be used. The buffer is preferably prepared in 
a way so as not to contaminate it with RNase. If any RNase contamination is present, the 
activation thereof should be controlled to be as low as possible. RNase-free buffers are 
commercially available, such as within mRNA purification kits or lysis buffer as shown 
before. 
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In order to eliminate RNase activity from water used in the methods of this 
invention, the water is preferably treated with Diethylpyrocarbonate (DEPC). The preferred 
DEPC treatment involves addition of 0.1% DEPC to the water, followed by storage 
overnight at 37©C and sterilization in an autoclave. 
5 Washing Procedures 

Preferably, after incubation, unbound components of the sample are washed from 
the insoluble support. Appropriate solutions for washing include the lysis buffer as shown 
before or the buffer included in the mRNA purification kit. 
Application of Second Polynucleotide Probe 
0 The second polynucleotide probe is applied to the insoluble support after any 

washing of unbound components of the sample, under conditions as described above in 
connection with the hybridization of mRNA in the sample with the first nucleotide probe. 
This is preferably followed by washing, as described above. 
Quantitation 

The amount of mRNA in the sample is quantitated by measuring the amount of 
second polynucleotide probe bound to the insoluble support after application thereto in 
accordance with the present invention. In this regard, a physical or chemical quantity or 
activity of the label on the second polynucleotide can be measured by several techniques, 
including optical density, emitted-ltght intensity, or radiation. The label itself can provide 
this indication or can require other compounds which bind thereto or catalyze the label. 
Other mechanisms for detecting label include the use of compounds that can chemically 
react against the label, detection of colored label, detection of light emission, detection of 
radiation or of catalytic ability. 

In a preferred measurement technique, the label on the second polynucleotide is 
biotin. The presence of this label can be detected by reaction with the enzymes Peroxidase 
or Alkaline phosphatase. These enzymes can be specifically directed to biotin by 
conjugation with avidin or streptavidin. The presence of the enzymes is then detected by 
addition of an appropriate substrate to provide a detectable color-developing or light- 
emitting reaction. The alkaline phosphatase labeled streptavidin can be readily obtained 
from the commercial market. A preferred substrate for alkaline phosphatase is adamantyl- 
1.2-dioxetane phosphate (AMPPD). Upon reaction with the alkaline phosphatase, AMPPD 
will emit light at a wavelength of 447 nm. This light can be detected in accordance with 
techniques known in the art. 
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Tn the reaction between alkaline phosphatase and AMPPD. an enhancer such as 5-N- 
tetradecanoyl-amino-fluoresceincan be added. 5-N-tetradecanoyl-amino-fluorescein has the 
ability to convert light of 477 nm wavelength to 530 nm wavelength which is more readily 
detectable. 

5 Other preferred labels include an antigen, such as digoxigenin or an antibody. An 

antigen can be detected by its ability to bind to an antibody directed thereto. Such 
antibodies, or an antibody directly used to label the second polynucleotide probe, can be 
detected by their ability to bind protein A. The antibody itself can be labeled directly with 
a radionuclide, such as 125 I t or can be labeled by binding thereto of protein A labeled with 
10 the radionuclide. The radionuclide can then be detected in accordance with techniques well 

known in the art. 

Well-known techniques for the detection of label include the detection of label which 
appears by a color-developing reaction by a spectrophotometer. Other such techniques 
include the detection of a light emitting reaction using X-ray film or an instant camera, and 

15 the detection of raditation using a radiation counter. 

The emission reactions are recorded by X-ray film or instant camera film in the dark 
room. The X-ray film which is exposed by emission reactions are recorded as a blot, so this 
shading of the blot can be measured by a densitometer. If one uses an instant camera such 
as a Polaroid, the picture is read by a scanner to decide the location of the blot on the 

20 computer, measuring the shading of the blot using graphic analysis software. 

To measure the amount of target mRNA in an unknown sample solution, an 
arithmetic plot for showing the relationship between a standard solution of target mRNA 
and relative concentration in a blot can be used. The standard solution for target mRNA 
need not be purified target mRNA. Thus, the standards can also be a mixture in which the 

25 amount of target mRNA is known. 

Schematic Example 

A schematic representation of an exemplary method in accordance with the present 
invention is shown in Figure 3. The target mRNA 1 is shown having a polyadenylic acid tail 
2. The first nucleotide probe 3 is immobilized to an insoluble support 4, then mixed the 
target mRNA 1. The target mRNA 1 hybridizes with the first nucleotide probe 3, thereby 
immobilizing the target mRNA to the insoluble support. Then, a second nucleotide probe 
5 which bears a label 6 hybridizes with a portion of the target mRNA other than the portion 
hybridized with the first nucleotide probe. As shown, the second nucleotide probe 5 
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comprises poly (dT) and hybridizes with the polyadenylic acid tail 2 of the target mRNA 1. 
The resulting complex 7 can then be identified by detecting the presence of the label 6. 
Time-Savings of Invention 

Advantageously, the present invention provides a method of measuring mRNA which 
5 can be performed in far less time than previously known methods. Table 1 hereinbelow 

compares the time needed to measure DNA using the method of this invention and a 
Northern blot method. 
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TABLE 1 

Comparison For Measurement Of mRNA 



Present Invention 



Northern Blot Method 



25 



30 



35 



40 



Time 
(hours) 



Time 
(hours) 





— Lysis of Cells 


1 


— Purification of mRNA 


48 




— Hybridization of the 




— Separation of mRNA 




10 


first nucleotide probe 




by electrophoresis 






to mRNA 


1 


with Agarose Gel 
— denaturing of mRNA 
moving the mRNA to 
membrane 


1 
1 


15 


— hybridization of the 




— immobilization of mRNA 
to membrane 

— hybridization of the mRNA 


2 




second nucleotide probe 




and probe 


12 




to mRNA 


1 






20 


— washing and blocking 


0.2 


— washing and blocking 


4 




— detection 


1 


— detection 


1 




Total 


about 5 hours 


about 70 hours 



As shown in Table 1, the methods of this invention generally take 5 hours or less to 
complete, while the Northern blot method generally requires about 70 hours. These times 
do not include the labeling time (about 12 hours) of the nucleotide probes nor the time for 
immobilization of the first nucleotide probe because these constructs can be prepared prior 
to the measurement. 

Particular examples of target mRNA that can be measured to obtain clinically useful 
information are provided below. The examples included herein are intended to illustrate 
certain preferred embodiments of the present invention. These examples are not intended 
to limit the invention in any manner. 
B Receptor 

A potential target for using mRNA measurement to determine a diseased state 
utilizes the mRNA for the p receptor. The p receptor is a protein located in human nerve 
tissue. In particular, abnormalities in the p 2 receptor has been found to be closely 
correlated with asthma. Thus, measuring the mRNA for P 2 receptor can be used to 
determine the pathophysiology of asthma patients, and could also be used to assess the 
effectiveness of anti-asthma auents. 
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EXAMPLE 1 
Measurement of Human p 2 Receptor mRNA 
(Preparation of Calibration Line) 
(D Preparatio n of the First Nucleoside Probe 
5 An oligodeoxyribonucleotide probe containing the nucleotide sequence shown below 

with an amino group attached to its 5' end was synthesized by Genosys, Inc. (TX). In this 
sequence, A, C, G and T indicate adenine, cytosine, guanine, and thymine, respectively. 
Based on computer analysis (DNASIS, Hitachi Software Engineering America, Ltd., CA), 
this sequence was determined to be homologous to the human p 2 adrenergic receptor 
10 mRNA. 

5'-NH 2 -ATG CTG GCC GTG ACG CAC AGC A-3' (Seq ID NO: 6) 
< 2) Immobilization of the First Nucle otide Probe Onto Insoluble Supports 

Both EDC and sulfo-NHS (Pierce, IL) were dissolved in DEPC-treated water at 
concentrations of 20 mM and lOmM, respectively. EDC/Sulfo-NHS solution was then 
15 prepared by mixing equal volumes of both EDC and sulfo-NHS- The first nucleotide probe 

was dissolved in DEPC-treated water at a concentration of lu.g/ul and then mixed with the 
EDC/Sulfo-NHS solution in the ration 1:25 (Vol:Vol). 

50ul of this probe solution was added to each well of a microtiter plate (MS-3796F, 
Sumitomo Bakelite, JAPAN), which were known to express carboxyl groups on their surface. 
After incubation at room temperature overnight, the reaction solution was removed by 
aspiration. 

(3) Preparation of mRNA 

DNA plasmid pUCl8 containing the human p 2 adrenergic receptor cDNA (pTF. 
American Type Culture Collection) was digested with Pstl and EcoRl and mixed with the 
pBC phagemid (Stratagene, CA) which had been predigested with the same restriction 
enzymes. The resultant plasmid was linearized by Pstl extracted once with phenol- 
chloroform, precipitated in ethanol, and used to transcribe the human P-> receptor mRNA 
using the Riboprobe system and T7 RNA polymerase (Promega, WI). The transcribed 
human p 2 receptor mRNA was treated with RNase-free DNase to digest the remaining 
template cDNA. then purified with one cycle of phenol extraction and ethanol precipitation. 
The resultant mRNA showed up as a single band on agarose (FMC byproducts, ME) gel 
electrophoresis. Messenger RNA concentration was measured by a spectrophotometer at 
260 and 280 nm against standard mRNA solutions prepared at concentrations of 25 pc, 250 
pg, 2.5 ng. 25 ng and 250 ng. respectively. 
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( 4 ) Hybridization of the First Nucleotide Probe to mRNA 

In order to remove RNase from the well, the first nucleotide probe-immobilized 
wells were treated with 250 \il of lysis buffer containing 0.5 M NaCl at 45<>C for one hour. 
The buffer was removed from individual wells by aspiration, and 50 *il of lysis buffer 
5 containing varying concentrations of standard human p 2 receptor mRNA was added into 

each well. These solutions were incubated at 59° C for one hour to allow hybridization. 

(5) Preparation of the Secnn ^ Nucleotide Probe 

The second nucleotide probe was a 15-mer oligo(dT) labeled with biotin at the 5' 

end. 

10 < 6 > Hybridization of the Labeled Secon d Nucleotide Probe to the PoMA) tail of mRNA 

After hybridization of the first nucleotide probe and mRNA described in step (4), 
the hybridization solution was removed by aspiration and the individual wells were washed 
with 250 ul of lysis buffer. Fifty nl of lysis buffer containing 0.5 M NaCl and 1 nl of the 
biotinylated second probe solution (35 pmol/jil) was added into each well, and incubated 
15 at room temperature for an additional one hour. ^ 

< 7 > Measurement of Che mical Activities of the Labeled Second Nucleotide Probe 

Following the hybridization described in step (6), the hybridization solution was 
removed by aspiration and wells were washed once with 250 ul of lysis buffer. A blocking 
buffer consisting of (0.05%(w/v) Tween 20, 500 mM NaCl, 100 mM Tris-HCl, pH 7.5) was 
added into each well and incubated at room temperature for five minutes to reduced 
nonspecific binding. Reaction solutions were then removed by aspiration, 50 ^1 of blocking 
solution containing 1/5000 vol. of alkaline phosphatase-conjugated streptavidin solution 
(Clontech, CA) was added into each well, and incubated at room temperature for 30 
minutes. Each well was then washed with 250 \il of washing buffer containing 500 mM 
25 NaCl, 100 mM Tris-HCl, pH 7.5, and 50 |xl of Lumiphos 530 solution (Boehringer- 

Mannheim, IN) whole plates were placed on X-ray film (XAR5, Kodak, NY) in specific 
holders (MYC, Kodak, NY). X-ray film was exposed to a generated chemiluminescent light 
at room temperature for one hour, and then developed using a film processor (QX-400, 
Konica, Japan). Relative intensities of individual wells on X-ray films were quantified bv 
30 densitometry (model 620, Bio-Rad, CA). 

(8) Results 

The results of Example 1 are shown in Figure 8. The inset (A) shows results of an 
X-ray film exposed to chemiluminescent light during the detection of mRNAs by the 
invented method; (B) shows a densitographic representation of the same image as in (A): 
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(C) exemplifies a calibration line of the mRNA in which the X-axis represents 
concentrations of mRNA and the Y-axis represents the relative intensity of 
chemtluminescence in an individual weU . Wells 1, 2, 3, 4 and 5 of Figure 8 correspond to 
the amounts of tested mRNA at 25 pg, 250 pg t 2.5 ng, 25 ng and 250 ng, respectively. As 
5 shown in (A), mRNAs were detected in amounts over 25 pg by the invented method. The 

calibration line shown in (C) shows the high linearity of signal when increasing 
concentrations from 25 pg to 250 ng of mRNA. 

COMPARISON 1 
Quantification of mRNA bv Northern Blot 
10 (1) Agarose Get Electrophoresis of mRNA 

Ten mM of sodium phosphate buffer, pH 7.0 containing 0,1% DEPC was heated 
to 37oC overnight and autoclaved. A 1% agarose solution was prepared using this 10 mM 
DEPC-treated phosphate buffer. The solution was chilled to ca. 50<>C, and iodine acetate 
was added to 0.2% (W/V). A gel slab was formed (6.5 cm x 10 cm) with combs and placed 
15 at room temperature for one hour to allow hardening. Five \il of DEPC-treated water 

containing 250ng, 25 ng, 2.5 ng, 250 pg or 25 pg of mRNA described in example 1.(3), were 
mixed with 7 |il of loading buffer (125 *il DMSO, 42 \il 40% glyoxal, 3 |il 1 M sodium 
phosphate buffer, pH 7.0), and incubated at 50©C for one hour. Samples were applied to 
the gel, and electrophoresed in a solution of DEPC-treated 10 mM phosphate buffer at 
20 constant 6V/cm for 15 minutes without circulation. An additional one hour of 

electrophoresis was performed with circulation. 

Following electrophoresis, gels were immersed in 50mM sodium hydroxide containing 
0.5 jig/ml ethidium bromide (Sigma, MO) for 20 minutes, and 0.1 M Tris-HCl buffer, pH 
7.5 for 40 minutes. Gels were then transferred into fresh 0.1 M Tris-HCl buffer and 
25 photographed under UV light. 
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(2) Northern Transfer of mRNA Onto Membranes 

Filter paper (3MM Chr, Whatman, NJ) and a nylon membrane (Magnagraph, 
Micron Separations, MA) were immersed in sterile water for 20 minutes and then placed 
in lOx SSPE (1.5 M NaCL 115 mM sodium phosphate, 11.5 mM EDTA in water, pH 7.4 
5 adjusted with 10 N NaOH). A blotting sponge (Stratagen, CA) was also immersed in lOx 

SSPE. The 3MM filter, nylon membrane, agarose gel, and sponge were assembled from the 
bottom in a Posi-blot Pressure blotter (Staratagen, CA), to cause the mRNA to transferred 
onto the nylon membrane by positive pressure (835 mmHg for 45 minutes). The nylon 
membrane was dried, and the mRNA was permanently immobilized onto the nylon 
10 membrane by UV light cross linkage at 120 mjoules (Stratalinker 1800, Stratagen, CA). 

(3) Preparation of Biotinvlated cDNA Probe 

DNA probes were prepared from p 2 receptor cDNA by using a DNA labeling kit 
(Boehringer-Mannheim, IN) and biotin-labeled deoxyuridylic acid (Clontech, CA). In brief, 
a P 2 receptor cDNA-containing plasmid, pTF was digested with Pstl and EcoRl, The 

15 resultant plasmid and cDNA were separated by agarose gel electrophoresis. A band of p 2 

receptor cDNA was cut out and removed from the gel using the Gene clean kit (Bio 101, 
CA). 0.1 |ig of above cDNA dissolved in 5 *il of water was incubated at 95cC for 10 
minutes, and then incubated with 2 ^1 of hexanucleotide mixture, 2 \il of dNTP and 1 \il of 
Klenow enzyme (DNA labeling kit, Boehringer-Mannheim, IN) at 37oC for one hour. 

20 Following incubation, the reaction was terminated by addition of 2 til of 0.2 M EDTA, pH 

8.0, 2.5 nl of 4.0 M LiCI and 75 \xl of 100% ethanol, and cooled at -70<>C for 30 minutes to 
precipitate cDNA. After centrifugation, the pellet was washed once with 70% ethanol and 
then dissolved in 50 \il of water. 

(4) Hybridization of Biotinvlated cDNA Probe to mRNA 

25 The mRNA containing membranes were incubated in hybridization buffer containing 

0.5 M NaCl (ECL, Amersham, IL) at 45<>C for one hour. Biotinylated cDNA probe 
described above were first heat denatured at 95oC for 10 minutes, immediately cooled on 
ice, and then added to the hybridization buffer. Hybridization was continued at 40oC 
overnight. 

30 (5) Chemical Activity Measurements of Labeled cDNA Probe 

After hybridization, membranes were washed twice in 0.5x SSPE solution containing 
36% (W/V) urea. 0.4% (W/V) SDS and 0.5 M NaCl at 65©C for 15 minutes each, followed 
by washing twice in 2x SSPE with 0.5 M NaCl at room temperature for five minutes each- 
Membranes were then incubated with 100 mM Tris-HCl buffer, pH 7.0, containing 2 c /c 
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blocking reagent (Boehringer-Mannheim, IN) and 150 mM NaCl at room temperature for 
three hours. 1/5000 vol. of alkaline phosphatase-conjugated streptavidin (Clontech, CA) was 
added to the above solution and incubated at room temperature 30 minutes. Membranes 
were washed twice with 100 mM Tris-HCl, pH 7.0, containing 150 mM NaCl at room 
5 temperature for 30 minutes each followed by one washing with alkaline phosphate buffer 

(100 mM NaCl, 5 mM MgCl 2 in Tris-HCl buffer, pH 9.5) at room temperature for 2 
minutes. Membranes were then soaked in Lumi-Phos 530 (Boehringer-Mannheim, IN), and 
quickly placed between two sheets of transparent plastic film. Chemiluminescent membrane 
signals were exposed to X-ray film (XAR5, Kodak, NY) in specific holders (MYC, Kodak, 
10 NY) at room temperature for one hour. 

(6) Results 

The Northern blot results of Comparison 1 are shown in Figure 8, in which the 
chemiluminescent signals of mRNA were exposed to an X-ray film. Lanes 1, 2, 3, 4, and 5 
of Figure 8 correspond to 25 pg, 250 pg, 2.5 ng, 25 ng and 250ng of applied mRNA, 

15 respectively. Because this cDNA probe is approximately 100 fold larger than the 

oligonucleotide probe used in Example 1, the amount of incorporated biotin is also 100 fold 
more than the oligonucleotide. This result indicates that the cDNA probes used in 
Comparison 1 are approximately 100 fold more sensitive than the oligonucleotide probe. 
However, as seen in Figure 8, a conventional Northern blot can only detect samples 

20 containing greater than 25 ng of mRNA. This is far less sensitive than the invented method 

as exemplified by Figure 8. Also, the invention as described by Figure 8 used less sensitive 
oligonucleotide probes when compared to the cDNA probes used on the Northern Blot. 

EXAMPLE 2 

Detection of Human p 2 Receptor mRNA From Unknown 
25 Samples And Evaluation of Reproducibility 

U) Preparation of the First Nucleotide Probe and Its Immobilization Onto the 
Insoluble Support 

The first nucleotide probe was prepared and immobilized onto an insoluble support 
as described in Example I. 

30 
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(2) Preparatio n of Standard mRNA 

mRNA was prepared as described in Example 1. 

(3) Preparation of Cell Lvsates 

Human HL60 promyelocyte leukemia cells were collected by centrifugation, washed 
5 with PBS (8 g NaCl, 0.2 g KCl 1.44 g Na 2 HP 4 and 0.24 g KH 2 P0 4 in 1 liter of water, pH 

7.4). Cells were then resuspended in lysis buffer at a concentration of 1 x 10 7 cells/ml. The 
solution was repeatedly passed through a sterile plastic syringe fitted with a 21 gauge needle 
and incubated in a slow-shaking waterbath at 45<>C for one hour. 

(4) Hybridization and Chemilumines cent Petectfo n 

10 As described in Example 1, hybridization of the first nucleotide probe, preparation 

of the second nucleotide probe, hybridization of the labeled second nucleotide probe to the 
poly(A) tail of mRNA, and chemical activity measurement of the labeled second nucleotide 
probe was carried out. 

(5) Preparation of Calibration Line 

15 As described in Example 1, a calibration line of known p 2 receptor mRNA 

concentrations was created (Figure 9). The calibration line expresses a linear co- 
relationship between mRNA concentrations mapped to a logarithmic scale and the relative 
chemiluminescent intensity read from X-ray film. 

(6) Evaluation of Reproducibility 
Concentrations of p 2 receptor specific mRNA from HL60 cell lysates were 

determined three times. In Table 2, each concentration of (J 2 receptor specific mRNA was 
determined from the calibration line described in section (5) above. The mean and standard 
error (S.E.) of data were calculated. Cell numbers were counted by a phase hemacytometer 
(American Scientific Products, IL). As shown in Table 2, three separate assays exhibited 
25 similar mRNA values at a range of 0.96 to 1.12 fg/cell. This suggests that the invented 

method provides reproducible method of quantifying specific mRNA levels. 



20 
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TABLE 2 

Concentrations of p 2 Receptor-Specific mRNA in HL60 Cells 



No. of assay mRNA Concentration [fg/ceU] 



10 



1 1.01 

2 0.96 

3 1.12 



mean ± S.E. 1.03 ± 0.04 

G Proteins 

Cell surface receptors for hormones and neurotransmitters are known to be coupled 
to intracellular heterotrimeric GTP-binding proteins (G proteins) composed of a, p and t 
15 subunits. Once receptors are activated by specific ligands. receptor-coupled G proteins 

transduce signals to intracellular secondary effector systems, such as adenylyl cyclase, 
phospholipase C, and ion channels. 

Recently, the molecular mechanisms of the interaction between receptors and G 
proteins have become less clear, because extensive molecular cloning has identified more 

20 subclasses of G protein a subunits than there are known cellular functions of G proteins. 

Furthermore, it has been found that a single receptor is also capable of coupling to multiple 
G proteins. For example, histamine H 2 receptors in human HL-60 cells were coupled to two 
different G proteins, which activated adenylyl cyclase and phospholipase C respectively. It 
has also been reported that GABA-B receptors were capable of coupling to G 0 , G 0 * and G 

25 i proteins, but not G^ proteins. Moreover, CHO cells have been shown to express multiple 

G proteins, which activate the same phospholipase C pathways but are coupled selectively 
to different receptors. Some receptors may alter their G-protein subclass association 
depending on the state of differentiation of the cell or on the particular phase of the cell 
cycle.One example would be the mRNA coding for G protein. G protein binds to and 

30 controls the function of various receptors in the cell membrane. It catalyzes the intracellular 

reactions of cAMP and calcium ion free reaction systems. These molecules are used in 
various cellular transmitter systems. 

G proteins are believed to be involved in causing various disease states. For 
example, a genetic deficiency of G s proteins is the molecular basis of hereditarv 
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osteodystrophy. Pituitary tumors in acromegalic patients have been shown to contain 
mutant Gs proteins. G proteins are also involved in invasive and metastatic melanoma cells. 
Rat models of streptozotocin-induced experimental diabetes suggest that the levels of 
mRNA for various subclasses of Got proteins are significantly altered from normal control 
5 rats. Furthermore, cellular functions of pertussis toxin-sensitive G proteins were shown to 

be significantly impaired in atherosclerotic porcine coronary arteries, while G protein 
function in leukocytes of patients with mania was hyperfunctional. Thus, those diseases 
which are known to be related to a malfunction of the G protein include atherosclerosis, 
bipolar disorder, diabetes mellitus, melanoma, and pituitary tumors. 

10 Currently available immunological detection methods (Western blots) and mRNA 

detection methods (Northern blots) are not sensitive and require a lot of cellular material, 
making it difficult to study the role of G proteins in disease. It is not easy to ascertain the 
quantity of expressed G protein. If the mRNA for the G protein was measured, it could be 
used to recognize the pathophysiology of patients having one of the foregoing ailments. The 

15 effects of agents designed to arrest or control these conditions could also be measured using 

this technique. G protein consists of a, p and y subunits. The a subunit is known to have 
several allelic variations. Thus, by measuring each variation of G protein mRNA, the 
amount of expressed protein for each variation can be estimated. 

EXAMPLE 3 

20 Quantification of Human G Protein a 

Suhunit-Specific mRNA 

(1) First Nucleotide Probe Preparation 

The first nucleotide probes used against Gs protein-specific mRNA and Gi-2 protein- 
specific mRNA were oligodeoxyribonucleotides. These nucleotide sequences, shown below, 
25 contain an amino group on their .V end and were synthesized by Genosys (Texas). 

The probe sequence used for quantifying Gs mRNA is shown below (Seq ID No. 1): 

5'-NH r TTC ATC CTC CCA CAG AGC CTT G-3* 

30 The sequence of probe used for quantifying Gi-2 mRNA is shown below (Seq ID No. 



5'-NH,-ATG GTC AGC CCA GAG CCT CCG G-.V 
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(2) Immobilization of t he Second Nudleotide Probe Onto Insoluble Supports 

The first nucleotide probes were dissolved in DEPC-treated water at lug/ul. Probes 
were then mixed with the EDC/Sulfo-NHS solution described in Example 1 Paragraph (2) 
in a ratio of 1:25 (Vol: Vol). A 50 ul sample was added to each well of a Sumilon microliter 
5 plate (MS-3796F, Sumitomo BAkelite, JAPAN) and incubated overnight at room 

temperature. Following incubation, the reaction solution was removed with an aspirator. 

(3) Preparatio n of Standard mRNA 

A plasmid vector pGEM2 containing rat Gs protein cDNA (provided by Dr. R. R. 
Reed, Johns Hopkins University, USA.) was linearized by digestion with Nhel (Promega, 
10 WI). Gs protein-specific mRNA was prepared by the method described in Example 1 

Paragraph (3) using T7 RNA polymerase (Promega, WI). The resultant mRNA 
concentrations were determined by spectrophotometry and subsequently diluted with DEPC- 
treated water as described in Example 1 Paragraph (3). 

(4) Preparation of Cell Lvsates 

15 Human B lymphocyte-derived IM9 cells were centrifuged, washed with PBS, and 

resuspended in lysis buffer at 1 x 10 7 cells/ml as described in Example 2. Cells were then 
lysed by repeated passage through a sterile plastic syringe fitted with a 21 gauge needle, and 
then incubated in a slow-shaking waterbath at 45©C for one hour. 

(5) Hybridizat ion of the First Nucleotide Probe to mRNA 

-° In order to remove RNase contamination in the microliter well, 250 \l\ of lysis 

buffer was added to each well at 45oC for one hour as described in Example 1. The buffer 
was aspirated, and 50 ul of lysis buffer containing 0.5 M NaCl and various concentrations 
of human Gs protein mRNA was added into each well. Each sample was incubated at 51°C 
for one hour to allow hybridization as described in Example 1. 
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(6) Hybridization of the Labeled Second N y clenHHo Probe to a PolvtAV Tail of mR m a | 
and Measurement of Chemical Activities of th<> f^ ied Second Nucleotide Prnh* 

After hybridization of the first nucleotide probe, the hybridization solution was 
aspirated off and individual wells were washed once with 250 ul of lysis buffer. Fifty ul of 
5 lysis buffer containing 0.5 M NaCl and 1 pi of the biotinylated second nucleotide probe 

solution (35 pmol/ul) was added into each well, and incubated at room temperature for one 
hour to allow hybridization. 

Chemical activities of the labeled second nucleotide probe was measured as 
described in Example 1. 
10 (?) Preparation of Calibration Line 

The calibration line of standard Gs protein-specific mRNA is shown in Figure 10. 
The calibration line expresses a relationship between the common logarithm of mRNA 
concentrations and the relative intensity of chemiiuminescent signals exposed to X-ray film. 
(8) Evaluation of Reproducibility 

Concentrations of Gs protein and Gi-2 protein specific mRNAs from IM9 cell lysites 
were measured six times to test reproducibility of the present invention. The methods used 
for Gi-2 were similar to that described above for Gs, with the incubation temperature used 
for hybridization of Gi-2 mRNA being 59°C. Table 3 lists the mRNA concentrations as 
obtained from the calibration line, and, the mean, and the standard error. CeU numbers 
from each sample were counted with a phase hemacytometer (American Scientific Products, 
IL). As listed in Table 3. all six assays reported similar mRNA concentrations for both Gs 
and Gi-2 mRNA. This suggests that the current invention provides a reliable, reproducible 
means of measuring specific mRNA concentrations. 
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TABLE 3 



No. of Assay mRNA Concentration [fg/cell] 



10 



Gs 


Gi-2 




1 


24.8 


4.6 


2 


10.6 


8.8 


3 


18.8 


7.2 


4 


35.5 


6.8 


5 


13.6 


11.5 


6 


33.0 


13.7 


mean ± S.E. 


22.7 ± 3.8 


8.77 ± 1.24 



EXAMPLE 4 

Simultaneous Quantification of Various mRNAs 
Derived F rom Cell Lines hv the Invented Method 
20 in Conjunction with Reverse PCR Method 

Simultaneous Detection of Gs an d Gi-2 Protein-Specific mRNAs in Various Cell 
Lvsates bv the Invented Method 

Using the method described in Example 3, specific mRNAs for both Gs and Gi-2 
protein from various cell lysates, such as Jurkat human T-ceUs, IM9 human B cell, U937 
25 human monocytes and HL60 human granulocytic cells were analyzed. Chemical activities 

of the labeled second probe were exposed to Polaroid film (667, Polaroid, MA) in a specific 
holder (model 901, Tropix, CA) at room temperature for 5 seconds, instead of exposing to 
X-ray films as described in Example 3. 

< 2) Simultaneous Detection of Gs and Gi-2 Protein-Specific mRNAs bv the 

30 Conventional Reverse PCR Method 

(a) cDNA Pr eparation and Amplification bv PCR 

mRNA was purified using the FastTrack kit (Invitrogen, CA) directly from various 
cell lvsates as described in Example 3 Paragraph (4). Samples were dissolved in 10 |il 
DEPC-treated water, and cDNA was synthesized using the Superscript kit (Gibco/BRL Life 
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Technologies, MD). Resultant cDNA was extracted once with phenol, precipitated in 
ethanol, and dissolved in 10 |al of DEPC-treated water. 

One til of solution containing cDNA was mixed with 1 ^il each of 10 mM dATT\ 
dCTP, dGTP and dTTP (Pharmacia LKB Biotechnology, NJ), 1 \il of 25 mM MgCl 2 , 1 *il 
5 each of G protein-specific PCR primers, 0.5 \xl of Taq polymarese (Promega, WI), and 5 ^1 

of PCR buffer (Promega) to a final volume of 50 Sense and antisense G protein-specific 
PCR primers are exibited below 

AGCACCATTGTGAAGCAGATGA (sense) (Seq ID No. 3) 

CTCTG G CCTCCC AC ATCAAAC A (5' to 3' antisense) (Seq ID No. 4) 
10 These sequences are known to be highly conserved among various G proteins, with 

this pair of primers being known to amplify Gs, Gi-1, Gi-2, Gi-3 and Go protein specific 
cDNA from data gathered in preliminary experiments. Followed by pre-denaturation at 
95<>C for 10 minutes, PCR was carried out in a DNA thermal cycler (Perkin-Elmer Cetus, 
CT) with 30 cycles of annealing at 55©C for 1.5 minutes, extension at 72°C for 4 minutes. 
15 and denaturation at 95°C for 1.5 minutes, respectively. 

(b) Agarose gel electrophoresis of amplified cDNA 

A 1.2% agarose gel containing 0.00005% ethidium bromide, 0.001 M EDTA and 
Tris-acetate, pH 8.0 was prepared. Ten \i\ of each test solution was mixed with 2 \xl of gel- 
loading buffer (0.25% bromophenol blue, 0.25% xylene cyanol and 15% Ficoll (Type 400, 
20 Pharmacia LKB Biotechnology, NJ; in water) and heated at 65°C for five minutes prior to 

loading. Electrophoresis was performed at 6 V/cm for one hour. After electrophoresis, the 
cDNA bands were examined under UV light illumination. 

The gel was then immersed in 0.25 M HC1 for 30 minutes (depurination), 0.5 M 
NaOH containing 1.5 M NaCI for one hour (denaturation), followed by 1 M Tris-HCl, pH 
25 7.5 containing 1.5 M NaCI for one hour (neutralization). 

(c) Transfer of Amplified cDNAs onto Membranes (Southern BloO 
Amplified cDNAs were transferred onto nylon membranes and immobilized by UV 

light as described in Comparison 1(2). 

(d) Preparation of Biotin-Labeied cDNA Probes 

30 Ten ng each of Gs and Gi-2 cDNA clones were amplified by PCR as described in 

(2)(a) above, with the modification that 20% of dTTP was replaced with biotin-labeled 
dUTP (Clontech). In preliminary Southern blot analysis, these biotin-labeled PCR products 
have been shown to specifically hybridize to Gs and Gi-2, respectively. 

(e) Hybridization of Labeled cDNA Probes to Southern Membranes 
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As shown in Comparison Example 1 Paragraph 4, biotin-labeied PCR products were 
heat-denatured at 95oC for 10 minutes, then hybridized to Southern membranes, on which 
PCR-amplified cDNAs from cell lysates were immobilized. 

(0 Chemical Activity Measurement of Labeled cDNA Probes 
5 Chemical activity of labeled cDNA probes was measured as described in Comparison 

1 Paragraph (5). 
(3) R^ltS 

The results of Example 4(1) are shown in Figure 11. Polaroid film was exposed to 
chemiluminescent light derived from an oligo-(dT) probe by the invented method, in which 
10 Gi-2 and Gs protein mRNA from various cells were trapped onto the first oligonucleotide- 

immobilized plastic plate. Phosphate-buffered saline (PBS) was used as a control. As 
shown in Figure 11, mRNA was detected from all cell lines. 

The results of Example 4(2) are shown in Figure 12, in which Gi-2 and Gs cDNA 
probes were used on a Southern Blot. PCR-amplified G protein specific DNA from four 
15 different cell lines were immobilized onto a membrane. Figure 12 is a copy of the X-ray 

film detailing the chemiluminescent results of this probe. As shown in Figure 12, mRNAs 
are detected in all cell lines. Therefore, the invented method provides a high enough 
reliability to be used for quantification of mRNA. 

EXAMPLE 5 

20 Methods for Amplifying G Proteins With PCR Primers 

(1) Materials 

The cDNAs of rat G protein a subunits (G M , G^. G U3 , G s and GJ were provided 
by Dr. R.R. Reed (Johns Hopkins Univ.. MD). XZAP libraries of rat pituitary and intestine 
were provided by Dr. D.G. Payan (Univ. Calif. San Francisco). Kirsten murine sarcoma 

25 virus transformed rat kidney cells (KNRK), human 1M9 B-lymphocytes and human Jurkat 

T-lymphocytes were obtained from American Type Tissue Culture Collection, Rockville, 
MD). Cell culture media. Superscript (Gibco/BRL, Gaithersburg, MD), reagents for PCR 
(Promega, Madison, WI), ECL (Amersham. Arlington Height, IL), Genius, Lumi-Phos 530 
(Boehringer-Mannheim, Indianapolis, IN), FastTrack (Jnvitrogen San Diego, CA), k gtlO 

30 library of human HL-60 cells, biotin-dUTP, alkaline phosphatase-conjugated streptavidine 

(Clontech, Palo Alto, CA), dNTP (Pharmacia, Piscataway, NJ) were obtained from the 
designated suppliers. Other chemicals were purchased from Sigma (St. Louis. MO). 

(2) Cell culture 
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KRNK cells were grown in Dulbecco's modified Eagles medium containing 10% fetal 
calf serum, 100 U/ml penicillin and 100 ^g/ml streptomycin at 37°C in 5% C0 2 /95% air. 
Cells were fed every other day and passaged at 70-90% confluency with 0.1% trypsin in 
CA 2 + -Mg 2 + -free saline containing 0.02% EDTA. IM9 and Jurkat cells was grown in RPMI 
5 1640 containing 10% fetal calf serum, 100 U/ml penicillin and 100 ng/ml streptomycin ait 

37° C in 5% C0 2 /95%. Cell viability was more than 90% as assessed by the exclusion of 
trypan blue. 

(3) Primer design 

Rat clones of Ga proteins (G M (RATBPGTPB), (RATBPGTPA), G^ 

10 (RATBPGTP), G s (RATBPGTPD), and G 0 (RATBPGTPC) were retrieved from GenBank 

release 65.0 (HIBIIO, Hitachi America, Brisbane, CA). The nucleotide sequence similarity 
among these clones were then analyzed by the multiple alignment program (DNASIS, 
Hitachi). We have initially identified 7 highly conserved areas among them. These 
conserved nucleotide sequences were then analyzed against all mammalian sequences in 
15 GenBank in order to identify other similar sequences. The designed oligonucleotides (Seq. 

ID No. *) were synthesized by Genosys Biotechnologies (Woodlands, TX), and suspended 
in water at 100 pg/ml. 

(4) P£R 

One til of the template DNA was mixed with 1 mM each of dATP, dGTP, dCTP and 
20 dTTP, 1 ill of each PCR primers, 1 nl of 25 mM MgCI 2 . 5 \i\ PCR buffer, and 0.5 ^1 of Taq 

polymerase (18). PCR was then carried out in a DNA thermal cycler (model 480, Perkin- 
Elmer Cetus, Norwalk, CT) with 30 cycles of annealing temperature at ranging from 37°C 
to 65° C for 1.5 min, 72° C extension for 4 min followed by 95° C denaturization for 1.5 min. 
In separate experiments, 35% of dTTP was replaced with biotin-dUTP in order to prepare 
25 biotin-labeled probes. 

(5) Southern blot 

PCR products were separated by electrophoresis in 1.2% agarose, and stained with 
ethidium bromide (19). Gels were then depurinated in 0.25 N HC1 for 30 minutes and 
denatured in 0.5 N NaOH containing 1.5 M NaCl for 30 minutes. The gels were then 
30 neutralized with 1.0 M Tris, pH 7.6 containing 1.5 M NaCI for 30 minutes. Gels were then 

placed onto nylon membranes (MagnaGraph, MSI, Westboro, MA) prewetted in 10X SSPE 
for 10 min, and DNA was transferred onto membranes by positive pressure at 75 mmHg for 
60 minutes (Posiblot, Stratagene. La Jolla. CA). The DNA from the gel was then cross- 
linked to the membranes with ultraviolet light at 120 mjoules (St rata I inker. Stratagene). and 
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the membranes were incubated with hybridization buffer (ECL) containing 5% blocking 
reagent (ECL) and 0.5 M NaCl at 40°C for more than 1 hour. Heat denatured biotin- 
labeled PCR probes were then added, and hybridization was continued overnight. The 
membranes were washed four times for 15 minutes each time with primary wash buffer (0.5x 

5 SSPE, 36 w/v% urea, 0.4w/v% SDS) at 45-65° C, then washed twice for 5 minutes with 

secondary wash buffer (2x SSPE) at room temperature, and were incubated with the 
blocking buffer (Genius) for at least 3 hours at room temperature. Alkaline phosphatase- 
conjugated streptavidine (1:5,000 dilution) was then added, and incubation was continued 
for an additional 30-60 minutes at room temperature. The membranes were washed four 

0 times for 15 minutes with buffer A (100 mM Tris* pH 7.5, 150 mM NaCl) at room 

temperature, were washed for 2 minutes once with buffer C (100 mM Tris, pH 9.5, 100 mM 
NaCl, 50 mM MgCI 2 ), and soaked in Lumi-Phos 530 for approximately 1-2 minutes. The 
membranes were then wrapped with transparency films, and chemiluminescent signals were 
allowed to expose X-ray films (XAR-5, Kodak, Rochester, NY) for between 10 minutes and 

5 1 hour. 

(6) mRNA preparation and cDNA synthesis 

The cells were washed with phosphate buffered saline three times, homogenized in 
lysis buffer (FastTrack), and then incubated at 45° C for 1 hour to eliminate any RNase 
activity. NaCl concentrations were adjusted at 0.5 M, and an oiigo (dT) cellulose tablet was 

D added to lysis buffer. Incubation was then continued at room temperature for an additional 

40 minutes. After oiigo (dT) cellulose was washed with binding buffer (FastTrack) four 
times, bound mRNA was eluted with DEPC-treated water. Concentrations of mRNA were 
determined in a spectrophotometer (Hitachi, U-2000, Irvine, CA) at OD 260 . The first strand 
cDNA was synthesized from a template mRNA in the presence of 50 mM Tris, pH 8.3, 75 

5 mM KC1, 3 mM MgCl 2 , 10 mM DTT, 0.5 mM each of dATP, dCTP, dGTP, and dTTP, poly 

(dT) as a primer, and reverse transcriptase (Superscript) at 37° C for 1 hour. Second strand 
cDNA was then synthesized in the same tube, containing 25 mM Tris, pH 7.5, 100 mM KC1, 
5 mM MgCI 2 , 10 mM (NH 4 ) 2 S0 4 , 0.15 mM P-NAD + , 250 u.M each of dATP, dGTP. dCTP 
and dTTP, 1.2 mM DTT, 65 U/ml DNA ligase, 250 U/ml DNA polymerase, and 13 U/ml 

) RNase H (Superscript) for 2 hours at 16° C. Synthesized cDNAs were then extracted once 

with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), precipitated with 
ethanol, and resuspended in H 2 0. 

(7) Graphic presentation 
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Data on Polaroid films and X-ray films was scanned by Stratascan (Stratagene) with 
optimization of signal-to-noise ratio, then edited with desk top publishing software 
(PageMaker, Aldus, Seattle, WA). 

EXAMPLE 6 

5 Characterization of Cellular Components of G Protein 

a Subunits Using PCR 

(1) Introduction 

Using the methods of Example 5, we have now identified and characterized two 
highly conserved oligonucleotide sequences among five different a subunits of G proteins, 
10 SEQ ID NO:7 and SEQ ID NO:8, which can be used as PCR primers. We have found that 

the sequences of all the subclasses of G proteins can be amplified under the same PCR 
conditions using these two oligonucleotides as PCR primers, including both G protein 
sequences obtained from a mixture of rat Get protein clones and cDNAs derived from 
various human tissues. 

15 Although G proteins have been analyzed extensively from a biochemical and 

immunological point of view using various antibodies, antibody production without any cross- 
reactivity among various subclasses or with high species specificity has been quite difficult 
to obtain. Therefore, recent experiments have focused on Northern blot analyses to identify 
G protein-specific mRNA from various tissues or cells in different species. However, 

20 Northern blots require experienced handling and protection from RNase contamination, in 

addition to a large amount of starting cellular materials. 

In contrast to these conventional methods, PCR technology is more convenient and 
practically useful, because it requires less material than a Northern blot analysis and has 
great sensitivity. However, it is difficult for PCR to quantify the amount of DNA or mRNA 

25 in starting materials. Interestingly, the final PCR products obtained using the novel G- 

protein PCR primers of the present invention reflect the relative composition of each of the 
subclasses of Ga proteins present in the starting materials. This is probably because the five 
different Ga proteins cDNAs are amplified at a similar rate with a single set of PCR 
primers under the same PCR conditions. If known mixtures of each of the subclasses of Got 

30 protein clones are assayed together with unknown test samples, as shown in Figure 13, the 

relative composition of Ga proteins can be determined fairly precisely. Therefore, the 
present method is ideal for the characterization of Ga proteins in various tissues and cells. 

Performing PCR with the primers of the present invention is also useful in clinical 
and diagnostic assays in the detection of disease. Since G protein abnormalities have been 
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associated with hereditary diseases, cancer, forms of diabetes, and other diseases, the 
present PCR primers for detecting and quantifying G proteins can be used to detect these 
diseases and assess their severity. 

In the present study, we have identified G^, G { _ 2 * G^, G s and G 0 using the PCR 
5 primers of the present invention. Although recent cloning has identified more subclasses 

of G proteins, all of these newly identified G protein cDNAs showed high degree of 
homology to other known G proteins. Therefore, it is expected that the primers of the 
present invention will amplify these subclasses as well, and that the present PCR the present 
PCR technique can also be applied to these new G proteins. Moreover, this PCR method 
10 can be utilized to clone unique G protein genes as well. 

(2) Primers 

We have designed two 22-mer oligonucleotides (G 2 and G 4 --SEQ ID NOS:700 and 
701) as PCR primers. As shown in Table 4 below, these oligonucleotides contain sequences 
which are highly conserved among five different Got protein cDNAs, having only 0-4 base 

15 mismatches per sequence. No mismatch was found in the 4 bases at the 3 1 end of the G 2 - 

sense and G 4 -antisense sequences. Furthermore, G 2 and G 4 have no self-complementary 
sequences more than 3 base pairs in a row (data not shown). In order to analyze whether 
G 2 and G 4 are common to all the Got proteins, but not to other unrelated sequences, a 
homology search (DNASIS) of G 2 and G 4 sequences was carried out against all mammalian 

20 sequences in GenBank. As a result, G 2 and G 4 were found to be common to all the types 

of Gee proteins and rhodopsins of various species, but less homologous to other unrelated 
sequences (data not shown) 
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TABLE 4 



Two consensus oligonucleotides (G2 and G4) 
among five different cDNAs of G protein a subunits 



10 



15 



Gi-1 

G»-2 

Gi-3 

Gs 

GO 



Consensus sequence (# of mismatch) [SEO ID NO:] 



G2 

AGCACCAT T GTGAAGCAGATGA [700] 



G4 

Length (bp) TGTTTGATGTGGGAGGCCAGAG [701 J 



AGCACaATTGTGAAGCAGATGA (1) (702] 

AGCACCATcGTcAAGCAGATGA (2) (703] 

AGtACtATTGTGAAaCAGATGA (3) (7041 

AGCACCATTGTGAAGCAGATGA (0) (70S] 

AGCACCATTGTGAAGCAGATGA (0) (706) 



476 TGT T TGACGTGGGAGGCCAGAG (1) [805] 

479 TGTTTGATGTGGGtGGtCAGcG (3) [806] 

476 TGTTTGATGTaGGtGGCCAaAG <3) [707] 

524 TGTTcGATGTGGGcGGCCAGcG <3) [708] 

479 TGTTTGAcGTtGGgGGCCAGcG (4) [709] 



TABLE 5 

Nucleotide sequence similarity of PCR products 
among 5 different Ga protein clones. 



20 



Gi-1 



Gi-2 



Gi-3 



Gs 



GO 



25 



30 



35 



Gi-1 
Gi-2 
Gi-3 
Gs 
Go 
(3) 



73.4 
76.8 
50.2 
64.0 



73.4 

67.8 
52.3 
65.9 



76.8 
67.8 

47.6 
62.6 



50.2 
52.3 
47.6 

55.6 



64.0 
65.9 
62.6 
55.6 



PCR of rat cloned G« protein cDNAs 

The cloned rat G M , G;. 2 , G^, G s and G 0 cDNAs were diluted at 10 ng/ul, and 1 ul 
of DNA was mixed with 1 mM each of dATP. dGTP. dCTP and dTTP, 1 ul of each PCR 
primers, I ul of 25 mM MgCl 2 , 5 ul PCR buffer, and 0.5 ul of Taq polymerase as described 
in the Materials and Methods. PCR was then carried out with 30 cycles of annealing 
temperature at 45°C for 1.5 min, 72° extension for 4 min followed by 95°C denaturation 
for 1.5 min, 72° C extension for 4 min followed by 95 C C denaturation for 1.5 min. PCR 
products were then separated in 1.2% agarose gel. and .stained with ethidium bromide 
(upper panel). Marker indicates a 0.6 Kb fragment at Hindlll digested X DNA. DNAs 
were then transferred onto nylon membranes, hybridized with subclass specific biotin-labeled 
probes as described in the Materials and Methods. The membranes were washed with 
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primary wash buffer (0.5x SSPE, 36 w/v% urea. 0.4w/v% SDS) at 65° C. incubated with the 
blocking buffer for at Jeast 3 hours at room temperature, then alkaline phosphatase- 
conjugated streptoavidine (1:5,000 dilution) was added, and incubation was continued for 
an additional 30 min at room temperature. After extensive washing, membranes were 
5 soaked in Lumi-Phos 530 for approximately 1-2 min, then chemiluminescent signals were 

exposed to X-ray film for 10 min to 1 hr (lower 5 panels). 

PCR was first carried out at different annealing temperatures ranging from 37°C to 
65°C using the X gtlO library of human HL-60 cells. As a result, PCR products were seen 
only at 45° C and 55° C with a size of approximately 500 bp (data not shown), which was 

10 similar to theoretical values (576 to 524 bp) (see Table 4 above). Therefore, all the PCR 

was then carried out at an annealing temperature of 45°C 

As shown in Figure 14, cloned rat Go protein cDNAs (G M , G^, G,, 3 , G s , G 0 ) were 
successfully amplified using the same set of PCR primers (G-» and G 4 ) with a size of 
approximately 500 bp in 1.2% agarose gels stained with ethidium bromide. According to 

15 the computer analysis (DNASIS), the nucleotide sequences of the amplified PCR products 

were less homologous among five Got proteins with the percentage of similarity ranging from 
76.8% to 47.6%. This indicates that after PCR amplification, each of the components of Ga 
proteins can be identified by Southern blot analysis, even though the sizes of the PCR 
products generated are very similar among the five Ga proteins. Therefore, another PCR 

20 was carried out in which 35% of the dTTP was replaced with biotin-conjugated dUTP in 

order to prepare subclass-specific, biotin-labeled probes. Southern membranes were then 
probed with these biotin-PCR products. As shown in Figure 14. these biotin-PCR probes 
were highly specific to each Ga protein subclass with washing temperature at 65° C. At low 
stringent washing, these probes cross-hybridized with other subclasses of Ga proteins (data 

25 not shown). 

(4) PCR of a mixture of rat cloned Ga protein cDNAs 

G U1 , Gj. 2 * G i0 . G s and G 0 cDNA were diluted at 10 ng/ul (+ + + ). 100 pg/ul ( + + ) 
and 1 pg/ul ( + ), and various combinations of these diluted cDNA were used as a PCR 
templates (upper panel). Then, PCR and Southern blot were carried out as described in 

30 part (3) of this Example. 

By using the G^ and G 4 sequences, all the subclasses of Ga protein cDNA were 
amplified with PCR when an equal amount of G M , G,_ 2 , Gj_ 3 and G () were present in test 
samples Figure 13, lane 4, 5. 10). However, if all the concentrations of Ga protein cDNA 
are abundant, G n is less amplified (Figure 13. lane 10), probably because the number of 
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mismatches between G 0 and G 4 is higher than others between G 4 and the G 2 sequences. 
If 1 or 2 of the 5 Ga protein cDNAs were present in smaller quantities than the others, the 
amounts of amplified cDNA were relatively correlated with the starting concentrations of 
cDNAs (Figure 13, lane 1, 2, 3, 8, 9). Furthermore, if 1 of 5 of the Ga proteins' cDNA is 
5 more abundant than that of the others, this G protein gene was amplified more than others 

(Figure 13, lane 6, 7). 

(5) PCR of rat cDNAs fr om various tissues 

Rat XZAP cDNA libraries of pituitary (P) and intestine (I), and cDNA from the 
mRNA of rat kidney cells (KNRK) were used as PCR templates. Then, PCR and Southern 
10 blot were carried out as described in part (3) of this Example. 

Using this PCR method, Ga protein genes were amplified not only from cloned 
cDNAs, but also from various rat cDNAs (Figure 15). In XZAP cDNA libraries from rat 
pituitary glands and cDNA from rat kidney KNRK cells, G 0 was more abundant than G s , 
G^ 2 and G^, and G lA was undetectable (Figure 15, lane 1, 2). XZAP cDNA library of rat 
15 intestine contained more G^, G^, and G s and G 0 (Figure 15, lane 3). ^ ■■ 

(6) PCR of human cDNAs 

cDNAs from the mRNA of human IM9 B-lymphocytes and Jurkat T-lymphocytes 
were used as PCR templates. Then, PCR was carried out with annealing temperature at 
45°C (1,3) or 55°C (2, 4). Southern blot were then performed as described in part (3) of 

20 this Example. 

According to sequence analyses, the PCR products of rat G M , G^, Gj_ 3 , G s and G 0 
sequence amplification exhibited a high degree of homology to human G protein cDNAs, 
see Table 6 below). Furthermore, as shown in Figure 16, PCR with a pair of G ? and G 4 
primers could amplify 500 bp DNA from cDNAs of human IM9 and Jurkat cells. Unlike 

25 rat cDNAs (Figure 15), both IM9 and Jurkat cells contained all the subclasses of Ga 

proteins (Figure 16). However, Gj_ 3 is relatively more abundant in IM9 cells, while G s and 
G 0 were more in Jurkat cells than IM9 cells (Figure 16). 
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TABLE 6 



Nucleotide sequence similarity of PCR products between 
rat and human G proteins 



5 Length (bpf^o. of 

Human mismatch % Similarity 

Gi-1 476 476 62 87.0% 

Gi-2 479 479 44 90.8% 

10 Gi "3 476 476 40 91.6% 

Gs 5 24 482 52 89.2% 

Go 479 479 39 91.9% 

It will be evident to one having ordinary skill in the art that a variety of sequences 
could serve as sense or antisense primers for PCR methods or as probes for the detection 
15 of mRNA as described herein. A method for identification of such sequences that are 

either common to a variety of G proteins or specific to a particular species is provided 
hereinbelow. In the preferred embodiment of this method of identification, a computer 
program is used to identify the sequences. Through use of such a program, we have 
identified a large number of both common and specific primers and probes. Provided as 
20 Tables 7 through 13 are various sense sequences identified through the use of such a 

program that are useful as G protein probes and primers. 

All of the sequences listed in these tables are useful within the context of the PCR 
methods of the present invention. The complementary antisense sequences are also useful 
in certain aspects of the invention. As will be known having ordinary skill in the art. for 
25 common probes that are similar, but not identical to target sequences, stringency conditions 

can be varied (e.g. by changes in temperature and salinity) so that such probes will hybridize 
or fail to hybridize with a particular target sequence. Thus, also included within the present 
invention are sequences thar are capable of hybridizing with the same sequences as either 
the sense sequences listed or their anti-sense counterparts. 
30 Additional probes for G protein also include the following: 

Common G protein probes 

5'-CTCTGGCCTCCCACATCAAACA-.V (SEQ ID NO: 139) 
5 ' - TC A TCTG CTTCA C AA TGGTGCT-3' (SEQ ID NO: 140) 
Specific probes (Human & Rat common) 
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5'-GTTTTCACTCTAGTTCTGAGAACATG-3' (SEQ ID NO: 1 
5'-CAAAGTCGATCTGCAGGTTGC-3' (SEQ ID NO: 142) 
5'-ATGGTCAGCCCAGAGCCTCCGG-3' (SEQ ID NO: 143) 
5^GTCTTCACTCTCGTCCGAAGA-3* (SEQ ID NO: 144) 
5 '-G CCTTGG CATGCTCATAG AATT-3 ' (SEQ ID NO: 145) 
5'-TTCATCCTCCCACAGAGCCT^G-3 , (SEQ ID NO: 146) 
5'-CGCATCATGGCAGAAAGCAG-3* (SEQ ID NO: 147) 
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Table 7. G protein common primer (G2-S) 



G2-S 

AGCACCATTGTGAAGCAGATGA 



10 



Human 
Gi-1 
Gi-2 
Gi-3 
Gs 
Go 



HUMGNBPAI 

HUMGIAA 

HUMGIAB 

HUMGNPAS 

HUMGOAQ01 



SEQ ID NO: 
148 
149 
150 
151 
152 



Rat 



15 



20 



25 



Gi-1 

Gi-2 

Gi-3 

Gs 

Go 

Gx 



RATBPGTPB 

RATBPGTPA 

RATBPGTP 

RATBPGTPD 

RATBPGTPC 

RATGXA 



153 
154 
155 
156 
157 
158 



Highest matched sequences in GenBank 
HUMADECYC adenyl cyclase 159 : : 
RATACOAlacyl-coA oxidase 160 GC 
HUMTGASEtransglutaminase 161 CT 



CC: AC 
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Table 8. G protein common primer (G4-AS) 



SEQUENCE ID NO; 



G4-AS 

TGTTTGATGTGGGAGGCCAGAG 



10 



15 



20 



25 



Human 

Gi-l HUMGNBPA 162 :::::::::::::::: T: ::: : 

Gi-2 HUMGIAA 163 ::::::::::::: T :: T ::: C: 

Gi-3 HUMGIAB 164 :::::::::: A : : T : : : : :A: : 

Gs HUMGNPAS 165 :::::::C:::::T::::::C: 

Go HUMGOAQOl 166 ::::::: C :: C ::::::::: C : 

Rat 

Gi-l RATBPGTPB 167 ::::::: C ::::::::::::: : 

Gi-2 RATBPGTPA 168 ::::::::::::: T :: T ::: C : 

Gi-3 RATBPGTP 169 :::::::::: A :: T ::::: A : : 

Gs RATBPGTPD 170 :::: C :::::::: C :::::: C : 

Go RATBPGTPC 171 ::::::: C :: T :: G :::::: C : 

Gx RATGXA 172 :: G : G ::::::::::: G :::: : 

Highest matched sequences in GenBank 

HUMLDLRRL LDL-receptor 17 3 CTGA: :::::::::::: : TACT 

MUSHEPGFA hepatocyte growth 174 GAG : G ::::::: T :: A ::::: : 

HUMKEREP epidermal keratin 17 5 AG: : : : : : AT: : :GG: : : : : 
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10 



15 



20 



laDie 


Q 


Gi — 1 protein 


speci f ic , 


human 


-rat common 








primer (Gi- 


1) 






















Gi-1 








SEQUENCE ID 


NO: 


G ATGTT CTC AG AA CT AG AGTG AAAA C 


Human 


















1 


TIT TU/* 1 XT T~> T""l •» ~T" 

HUMGNBPAI 


176 








• • • • • 


bl- 


z 


TTT TVff/"'* T * » 

HUMGIAA 


177 


TT 




::::::: CT : CCCCTGTCCCCT 




J 


HUMGIAB 


178 






: : : :TC:G: :G: : : : : : 


: :G: : 






HUMGNPAS 


179 


AC 




:G:C: : :T: : : TCCTG : 


:C: :G 








T Q f\ 


: ;CA: 


:C: : :C: : : :C: :G: :C 




Rat 
















Gi- 


1 


RATBPGTPB 


181 






* ••••••••*••«••• 




Gi- 


2 


RATBPGTPA 


182 






G : : GC : G : ; CC : T : : : 


: :G: : 


Gi- 


3 


RATBPGTP 


183 






: : : TC:G: :G: : : : : : ; 


: :G: : 


Gs 




RATBPGTPD 


184 


A : GCACAATTA : TTA 


: : :CG 


Go 




RATBPGTPC 


185 


: :CA: 


* • • O • • » * /"** • * (~* • • /"* 4 

• « ■ >-» • • • • • « Vj • « V- , 




Gx 




RATGXA 


186 


TCA: : 


: : GAG : : C ; : : A : CC : : 


: :CA 
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Table 10. Gi-2 protein specific, human-rat common 
primer (Gi-2) 



10 



15 



20 



SEQUENCE ID NO: 



Gi-2 

GCAACCTGCAGATCGACTTTG 



Human 












Gi-i 


HUMGNBPAI 


187 


:G: GGT: : A: 




: A 


Gi-2 


HUMGIAA 


188 








Gi-3 


HUMGIAB 


189 


: ACGG : : AA : 




:T 


Gs 


HUMGNPAS 


190 


AGT : : : A : : T 




:T 


Go 


HUMGOAQ01 


191 


CTTCT: : : : : 


: :G: r 


Rat 












Gi-l 


RATBPGTPB 


192 


: G : GAT : : A: A: : 




Gi-2 


RATBPGTPA 


193 


•••**••»•., 

••••••••••4 






Gi-3 


RATBPGTP 


194 


AA: : : G : : : : : 


T: 


T 


GS 


RATBPGTPD 


195 


AGT: : : A: : T : 






Go 


RATBPGTPC 


196 


ATTCT: : : : : : 


: A:: 


Gx 


RATGXA 


197 


: T : : : A : C : : : 


T: 


T 



TT : 



dTOTUTH SHU^T 
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Table 11. Gi-3 protein specific, human-rat common 
primer (Gi-3) 

Gi-3 

SEQUENCE ID NO: TCTTCGGACGAGAGTGAAGAC 



Human 












Gi- 


1 


HUMGNBPAI 


198 


• • • A • A • * T* • « 

• • • • £\ * • X. * * 


:::::: A 


Gi- 


2 


HUMGIAA 


199 


G : : A: : : : : CC : 


C: : A: : : 


Gi- 


3 


HUMGIAB 


200 


•••••••••••i 




Gs 




HUMGNPAS 


201 


GGGAAATCGA : : 


:T: : :G: 


Go 




HUMGOAQ01 


202 


• • • C • ■ • • GA * 


: CGTG : A 



15 Rat 



Gi- 


1 


RATBPGTPB 


203 


: : :CA 


:A: :T: : : : 






: : A: : 


Gi- 


2 


RATBPGTPA 


204 


G: :G: 


: : : : CC : T: 








Gi- 


3 


RATBPGTP 


205 












Gs 




RATBPGTPD 


206 


TTCCT * 








* T : TG 


Go 




RATBPGTPC 


207 


CGCAT : 


: : G : : C : C : 






: CCA 


Gx 




RATGXA 


208 


AAC: : : 


:G: A: : : :CACCAT: 
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Table 12. Gs Protein specific, human-rat common 
primer (Gs) 

5 Gs 

SEQUENCE ID NO: AATTCTATGAGCATGCCAAGGC 



Human 

Gi-1 HUMGNBPAI 209 GGGCGG : : : : T : : : C : : : : : CT 

10 Gi-2 HUMGIAA 210 : : : ATG : : : : : GCA : : : : GCTA 

Gi-3 HUMGIAB 211 C : G : : : : ACTA : : : T : : : : CTC 

Gs HUMGNPAS 212 :::::::::::::::::::::: 

15 



SUBSTOTUTE SHE! 1ST 



WO 93/15221 



-53- 



PCT/US93/00977 



CO > 

o — 



* 

o 



o 

w 

c 
c 



8. 



a o 

CO w 



^> < 
- o 



< 

f\J »— 

CO o 

• < 

to <C 

o 
o 



IT* <* 

o 5 



c 

I? 



oj oj oj oj oj 



CO O AJ sT >0 
^ N N N N 

oj oj rvi (\i rvj 



r- co 
oj oj 



oo o 
is. oo 
rvi oj 



CO CO 

oj f\> 



co co 

oj oj 



• - <c o 

o o • o 

(J • « O 

U U ' U 



o o 
o o 

»— CJ 



«* . < 

o < 



CO CO 







O 




























o 




o 






















































<x 
























o 




o 
























LJ 


LJ 




















«*: 






CD 


<c 




u 






3 














a. 


a. 




Cl 


o. 






X 






at 






















< 






OJ 




OJ 


X 






o 


CJ 


o 












«t 




Cl 




Q_ 


a. 


o. 


a. 


Cl 




CO 






CO 


CO 




CD 




CD 


CD 


CO 


CO 


CD 




o 


o 




O 






X 




►— 












CO 


CO 




X 


X 




X 






*t 


*t 


«t 


< 






3 




3 


3 




3 


< 




DC 


cc 


ae 






X 


X 






X 


« 




























* 










OJ 


r*"> 








OJ 






OJ 


C 
















<U 






c 






<0 














o 


oo 








</) 


CO 


CD 






O 


(J 




O 


o 


3 


o 


o 




U 


o 


C 
















o 












OJ 
















x 






2: 






o 







£ CO 

>o 



CO —•• 

CO 

O C 

o 

OJ — 



i §■« 

CO 

U Q 

? ii S 



O CO 



no; 
•— J= 



O 
CM 



SUBSTITUTE SHEET 



WO 93/15221 



PCT/US93/00977 



-54- 

Table 13. Go Protein specific, human-rat common 
primer (Go) 



Go 

SEQUENCE ID NO: CTGCTTTCTGCCATGATGCG 
Human ~~ ~~ — 



10 



Gi- 


1 


HUMGNBPAI 


213 


GAG: 




: TAA : : : TGACA 


Gi- 


2 


HUMGIAA 


214 




: TGA: : 




Gi- 


3 


HUMGIAB 


215 




• • • • • 


: AGA : CTTCACA 


Gs 




HUMGNPAS 


216 


TCAACGA : 


: : : : G : : : CATC 


Go 




HUMGOAQ01 


217 










Gi- 


1 


RATBPGTPB 


218 




: A 




CCA : G : T : : : TT 


Gi- 


2 


RATBPGTPA 


219 


T: 




:GA: 


CC : : : GCG : : : : 


Gi- 


3 


RATBPGTP 


220 










Gs 




RATBPGTPD 


221 






:G : 


: : AG : A : G : C : T 


Go 




RATBPGTPC 


222 










Gx 




RATGXA 


223 


: A: 




C: T 


: : : : C : : : CTG : 
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Substance P 

Another example of clinically useful measurement of mRNA would be for the 
mRNA of substance P. Substance P is a neurotransmitter. Because the nerves expressing 
substance P are involved in pain receptor pathways, the ability to measure the mRNA of 
5 substance P would be useful in the development of analgesics. 

EXAMPLE 6 

Detection of Human Substance P Receptor mRNA From 
Substance P Receptor cDNA-Transferred Cells. Vector- 
Transfected Cells, and Evaluation of Their Reliability 

10 (1) Preparation of the First Nucleotide Probe 

The first nucleotide probe used for quantification of substance P receptor-specific 
mRNA was an oligodeoxyribonucleotides containing a 5' amino group. This probe was 
manufactured using a DNA synthesizer (model 380B. Applied Biosystems, CA) and is shown 
below: 

15 5'-NH r GGA CTT ATG AGA AAG CGT ACC A-3' (Seq ID NO:5). 

(2) Immobilization of t he Second Nucleotide Probe Onto Insoluble Supports 

The first nucleotide probe were dissolved in DEPC-treated water at a concentration 
of 1 ng/jil, ^en mixed with the EDC/Sulfo-NHS solution described in Example 1 Paragraph 

(2) in a ratio of 1:25 (Voi:Vol). 50 ^1 of the above mixed solution was added to each well 
20 of a Sumilon microtiter plate (MS-3796F, Sumitomo Bakelite, JAPAN) and the plates were 

incubated overnight at room temperature. Following incubation, the reaction solution was 
removed with an aspirator. 

(3) Preparation of Standard mRNA 

A plasmid vector pKR2 containing human substance P receptor cDNA (provided by 
25 Dr. S. Nakanishi, Kyoto University, Japan) was linearized by Smal (Promega. WI) digestion. 

Substance P receptor-specific mRNA was prepared by the method described in Example 1 
Paragraph (3) using T7 RNA polymerase (Promega. WI). Concentrations of the resultant 
mRNA were determined by spectrophotometry and then diluted with DEPC-treated water 
to prepare standard mRNA solutions as described in Example 1 Paragraph (3). 
30 (4) Preparation of Cell Lvsates 

One line of human T lymphocyte-derived Jurkat cells which had been transfected 
with substance P receptor cDNA was examined tor transcription of substance P receptor 
mRNA (hereafter called JSP, provided by Dr D G. Payan, University of California. San 
Francisco. CA). Another line of Jurkat cells which had been transfected with a vector that 
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didn't express substance P receptor (hereafter caUed JVec. a kind gift of Dr. D.G. Payan, 
ibid) was used as a negative control. Cells were collected by centrifugation. washed with 
PBS, and resuspended in lysis buffer at 1 x 10 7 celJs/ml as described in Example 2. The 
cells were sheared by repeated passage through a 21 gauge needle, then incubated in a slow- 
5 shaking waterbath at 45° C for 1 hour. 

< s > Hybridization of the First Nucleotide Probe to mRNA 

In order to remove possible RNase contamination in the microtiter wells containing 
the bound first nucleotide probe, 250 ul of lysis buffer was added and incubated at 45° C for 
1 hour as described in Example 1. The buffer aspirated off, and 50 ul of lysis buffer 
containing 0.5 M NaCi and various concentrations of standard human substance P receptor- 
specific mRNA was added into each well, and incubated at 50°C for 1 hour to allow 
hybridization as described in Example 1. 

(6) Hybridization of t he Labeled Second N ucleotide Prnhe to a Pnlvf Al Tail of mRNA. 
and Measurement of Chemical Activities of the l.ahe|ed Second Nucleotide Pmhe 

15 After hybridization of the first nucleotide probe to bound mRNA as described in 

Paragraph (5) above, the hybridization solution was removed by aspiration and individual 
weUs were washed once with 250 ul of lysis buffer. Five ul of lysis buffer containing 0.5 M 
NaCI and 1 ul of the biotinylated second nucleotide probe solution (35 pmol/ui) was added 
into each well and incubated at room temperature for an additional 1 hour to allow the 

20 second hybridization as described in Example 1. 

Chemical activities of the labeled second nucleotide probe was also measured as 
described in Example 1. 
(?) Preparation of Calibration Line 

The calibration line of standard substance P receptor-specific mRNA is shown in Fig. 

25 9. 

(S) Evaluation of Reproducibility 

Concentrations of substance P receptor-specific mRNA in JSP and JVec cell lysates 
were measured twice. Table 7 shows concentrations of mRNA obtained from the calibration 
line as described in Paragraph (7). Cell numbers from each sample were counted by a 
phase hemacytometer (American Scientific Products. IL). As shown in Table 14, substance 
P receptor-specific mRNA concentrations were found in JSP cells that had been transfected 
with substance P receptor cDNA, but were not detected from JVec cells transfected with 
only a control vector. This shows that the invented method provides good quantification of 
a specific mRNA with high reliability. 
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TABLE 14 



Concentrations of Substance P Recentor-Specific m RNA in b.rkat Cells Transfected with 
Substance P R eceptor cDNA or Only a Control Vector. 



5 

No. of Assay mRNA Concentration 



JSP [pg/celi] 
(Transfected) 



135 

62.4 



JVec (fg/ceU] 
(Control) 



<0.1 

<0.1 



Oligonucleotide Primers and Prob es for the Detection or lun Oncogenes 

15 ° ne aspect of the present invention relates to DNA primers and probes for the 

detection of jun oncogenes, such as by PCR. Expression of the jun gene can serve as a 
marker for cell growth. 

It is well known that certain oncogenes, such as jun or fos are most rapidly expressed 
when cells are stimulated with outside stimuli to proliferate. Therefore, detection or 
20 quantification of levels of expression of jun or fos oncogenes wi!J be a good marker for 

ceDular autogenic activity. 

Jun oncogenes are one of oncogenes first reported by Maki. et al. (Proc. Natl. Acad. 
Sci. USA. 84:2848-2852. 1987). and current intensive molecular cloning identified at least 
three subtypes of jun oncogenes, such as jun-B. c-jun and jun-D. Based on nucleotide 
sequence comparison, three subtypes of jun oncogenes in both humans and mice are 
different (Ryder K, Nathans D. Proc. Natl. Acad. Sci. USA 85:8464-8467, 1988; Ryder K. 
et al.. Proc. Natl. Acad. Sci.. USA 85:1487-1491. 1988; Hattori K. et al., Proc. Natl. Acad. 
Sci. USA 85:9148-9152. 1988; Schuette J. et al.. Cell. 59:987-997. 1989). However, because 
it is still unclear how these three different jun oncogenes are involved in cell growth, one 
30 has to analyze all of the three genes during cell growth. 

Northern blot analysis is widely accepted in order to detect specific genes (Sambrook 
J. et al.. Molecular cloning. A laboratory manual. 2nd eds. pp. 7.39-7.52). and some 
investigators have already adapted this method to detect jun oncogenes (Sherman, et al.. 
Proc. Natl. Acad. Sci. USA. 87:5663-5666. 1990; Oursler MJ et al., Proc. Natl. Acad. Sci. 
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USA, 88:6613-6617, 1991). In this method, mRNA is first purified from tissues/cell of 
researchers' interest, separated on agarose gel electrophoresis. After electrophoresis, 
mRNA is transferred onto membranes, and hybridized with radioactive probes to identify 
positive band(s) on autoradiography. However, this method is not so sensitive that it is 
5 difficult to identify specific signals if materials have only smaU quantity of genes. 

Alternatively, because reverse PCR ("Molecular Cloning") is also used to detect a wide 
variety of genes from different tissues/cells, this technique is also applicable for the 
detection of jun oncogene. In this method, mRNA is first convert to cDNA by reverse 
transcriptase, then specific gene fragments are amplified by PCR by using one set of primers 
10 (sense and anti-sense primers). Amplified gene can be seen on agarose gel electrophoresis 

in the prepense of ethidium bromide. This method is much more sensitive than the 
conventional Northern blot analysis, but in order to detect three different subtypes of jun 
oncogenes, one has to prepare six primers. Also, because of different nucleotide sequences 
between humans and mice, one has to prepare human and mice primers separately. 
15 ° ne g° al w « accomplished with the present invention was the design of 

oligonucleotide sequences for both sense and anti-sense primers that could be readily used 
for detecting the minuscule amounts of the jun gene found in vivo. We also wanted these 
primers to be usefuJ for detecting both human and murine jun sequences. Thus, we selected 
primers which fulfilled the below conditions: 
20 ( a ) A common nucleotide sequence among three subtypes of jun oncogenes (jun- 

B, c-jun and jun-D), with maximum 4 base mismatches among them; 

(b) A common nucleotide sequence among humans and mice, with maximum 4 
base mismatches between them; 

(c) At least 5 bases from 3' end are 100% identical without any mismatches 
25 among three subtypes and between humans and mice; 

(d) A nucleotide sequence length in both sense primer and anti-sense primer 
from 17 to 50; 

(e) A difference of T n? between sense primer and anti-sense primer is within 2 
degree C; 

30 (0 Lack of complementary structure with more than 4 bases long in either the 

sense or the anti-sense primer; 
(g) Lack of complementary structure with more than 4 bases long between the 
sense and the anti-sense primer; 
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(h) Both the sense and the anti-sense primer exist in the area of coding 
sequence; 

(i) The length of DNA to be amplified is greater than 200 bases; and 

0) The nucleotide sequence homology of amplified genes among three subtypes 
5 of jun oncogenes is not greater than 80%, and after gene amplification by 

the PCR, one can characterize each subtype of jun ongenes by Southern 
Blot. 

DNA fragments (both sense and anti-sense primers) which satisfy the conditions as 
shown above were investigated and the several candidate oligonucleotides were synthesized. 
10 The test results showed the sense primer 5'-CCCTGAAGGAGGAGCCGCAGAC-3' and 

anti-sense primer 5'-CGTGGGTCAAGACTCTGCTTGAGCTG-3' were desirable 
candidates. 

Table 15 shows the nucleotide sequence homology between a preferred sense primer 
and three subtypes of jun oncogenes in both humans and mice. 
15 TABLE 15 



20 



25 



Sense primer 
5 1 -CCCTGAAGGAGGAGCCGCAGAC-3 1 (SEQ ID NO:310) 



Mouse jun-B T-T A (SEQ ID NO: 311) 

c-jun A (SEQ ID NO: 312) 

Human jun-B — T-C A (SEQ ID NO: 313) 

c-jun T (SEQ ID NO : 3 14 ) 

indicates identical base to the sense primer. 

In addition, Table 16 shows the nucleotide sequence homology between the sense 
primers and three subtypes of jun oncogenes in both humans and mice. 
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TABLE 16 

Antisense primer 
3 • -GTCGAGTTCGTCTTTCAG AACTGGGTGC- 5 • (SEQ ID NO: 315) 



35 



Mouse jun-B C A- (SEQ ID NO: 316) 

c-;jun T (SEQ ID NO: 317) 

}un-D T G-G-C (SEQ ID NO: 318) 

10 Human J un-B T C (SEQ ID NO: 319) 

c-jun T (SEQ ID NO: 320) 

-: indicates identical base to the sense primer. 

Also, Table 17 shows nucleotide sequence homology of amplified genes between each 
15 of subtypes of jun oncogenes. 

TABLE 17 



20 


Subtypes 


Homology (%) 


jun-B & c-jun 


72.0 




jun-B & jun-D 


73.6 




c-jun & jun-D 


75.3 


25 


Table 18 shows the 


homology between human and mouse subtypes. 






TABLE 18 


30 


Subtypes 


Homology between human & mouse (%) 


jun-B 


93.0 




c-jun 


91.9 



The primer sequences can be modified in a variety of ways. For example, a primary 
amine residue, nucleotide sequence recognized by a restriction enzyme, or RNA promoter 
sequences for further modification after PCR amplification, can be added to the 5' end of 
the primers. These modifications are represented herein by the designation "X". Thus, for 
example, a modified primer as shown in Table 16. can be represented as 
5'-XCGTGGGTCAAGACTTCTGCTTGAGCTG-3' can also be used, in which X represents 
the primary amine residue, nucleotide sequence recognized by the restriction enzyme, or 
40 RNA promoter sequences for further modification after PCR amplification. 

The attachment of a restriction site at the 5* end (X) is useful for cloning of 
amplified genes. The attachment of RNA promoter sequences at 5" end (X) is useful in 
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RNA transcription and RNA transcription-based amplification. The attachment of primary 
amine at the 5* end is useful for coupling reaction to labeling compounds or solid supports, 
so that the amplified gene can be easily quantified. 

For the above-mentioned nucleotide sequence (X) which undergoes cutting by a 
restriction enzyme, instead of those nucleotide sequences with higher frequency such as Eco 
RI or Bam HI, it is often desirable to use longer recognition sequences (e.g. 8 bases) which 
appear in lower frequency such as the nucleotide sequence. 5-GCGGCCGC-3', which is 
recognized by NotI or the nucleotide sequence. 5*-TTAATTAA-3\ which is recognized by 
Pad. The reason for this is the relatively common nucleotide sequences recognized by 
EcoRI or BamHI can result in false positive reaction during PCR amplification because of 
priming of primers to undesirable DNA sequences. 

The above-mentioned nucleotide sequence (X) containing an RNA promoter can be 
any of a variety ot such sequences, such as the T7. SPf> or T3 RNA promoter sequences. 

The above-mentioned primary amine residues ( X) can be added at the 5' end during 
oligonucleotide synthesis. 

The DNA fragments for the sense primer and anti-sense primer in this invention can 
be easily synthesized in DNA synthesizer which are available in the market. These 
synthesized oligonucleotides can be purified by high pressure liquid chromatography or gel 
electrophoresis. 

The test material to be analyzed is usually a total RNA or purified mRNA from cells 
or tissues. If desired, cells or tissues can be tested in their natural state without any 
pretreatment; however, in the case of drug testing, this can be accomplished by reacting the 
drug with cells or tissues in the test tube, or the drug can be administered (intravenous 
injection, subcutaneous injection, intramuscular injection, oral administration, intra- 
abdominal injection) to a laboratory animal, and after a set time period, cells or tissues can 
be removed for RNA/mRNA purification. 

Methods of purification of total RNA or mRNA are well known in the art. 
Examples of such methods are the standard protocol described in Sambrook. et al., 
"Molecular Cloning, A Laboratory Manual, 2nd Ed.." Cold Spring Harbor Press. Cold Spring 
Harbor, NY ( 1989). and a commercially available kit such as FastTrack from Inviirogen (San 
Diego, CA). In either case, in order to avoid introducing RNase. hands should preferably 
be protected with vinyl gloves, and the instruments used for experiments not be touched 
with hare hands. Also, any glass containers to be used in the experiments should be heated 
prior to use at approximately 250 degrees C for at least 4 hours Furthermore. 0. 1 °'r. diethyl 
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pyrocarbonate (hereafter abbreviated DEPC) is added to the water which is to be used, and 
after incubation at 37 degrees C overnight, treated water is autoclaved. Other basic 
procedures are undertaken as outlined in the "Molecular Cloning," pp. 7.3-7.5. Vanadyl 
ribonucleoside complex can also be added to samples as described above. 
5 The method of synthesis of cDNA from m RNA template using reverse transcriptase 

is described in the "Molecular Cloning," pp. 8.11-8.13. 

For PCR amplification, synthesized cDNA is mixed with sense primer, antisense 
primer, 4 types of deoxynucleotides (dATP, dCTP, dGTP and dTTP). Taq polymerase, 
inorganic salts, and other necessary materials, and PCR reaction is undertaken in the 
10 Thermal cycler (Perkin-Elmar Cetus). A typical PCR method is described in "Molecular 

Cloning," pp. 14.2-14.33). 

In order to analyze the amplified gene, it is appropriate to use electrophoresis. 
After the amplified gene undergoes electrophoresis in agarose gel. the DNA is stained with 
ethidium bromide. The amplified DNA band will be then visible under fluorescent light as 
15 described in "Molecular Cloning." pp. E.3-E.4. After taking photographs of the DNA band, 

it is also possible to quantify the intensity of each bands by scanning photographs and 
analysis of scanned picture by commercially available system, such asStratascan (Stratagene. 
La JoUa). 

Furthermore, after agarose gel electrophoresis, amplified genes can be transblotted 
20 onto membranes, and subtypes specific genes can be detected by hybridizing with labeled 

probes followed by exposing labeled signals, such as 32P or chemiluminescence to either 
Polaroid films or X-ray films (Southern blot). 

EXAMPLE 7 

Synthesis of Sense and Anti-Sense Primers and 
25 Amplific ation of Mouse Clones of Jun Oncogenes 

The sense primer SEQ ID NO:9 (s943-2) and the anti-sense oligonucleotide SEQ 

ID NO: 10 (AS1 132-2) were synthesized by a synthesizer 380 B type (Applied Biosystems 

Co.). After treatment with ammonium hydroxide at 55°C overnight, synthesized 

oligonucleotides were dryed in a Speed-Vac (Savant Co.), and the concentration was 

adjusted at 1 microgram/ml with water, and was stored at -20 degrees C until use. 

1 microliter of one of the three types of mouse jun clones (jun-B, c-jun, or jun-D, 

obtained from ATCC). 1 microliter of sense primer, I microliter of anti-sense primer. 5 

microliters of 10X buffer for PCR (Promega). I microliter of 25 mM magnesium chloride, 

4 microliters of 10 mM dNTP mix, and 0.5 microliters of Tac] polymerase (Promega) were 
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missed, and water was added up to a total volume of 50 microliters. After adding of two 
drops of mineral oil, PCR was undertaken using the thermal cycler, model 480. After 
reaction mixture was heated at 95 degree C for 10 min, PCR was carried out with the 
following cycles 30 times: annealing at 55 degrees C for 1.5 minutes, extension at 72 degrees 
5 C for 4 minutes, and denaturing at 95 degrees C for 1.5 minutes. 

After PCR, 10 microliters of the sample were mixed with 1 microliter of lOx loading 
buffer (0.25% bromophenol blue, 0.25% xyienecyanol FF, and 15% Ficoll, Type 400). and 
electrophoresis was carried out in 1.5% agarose gel containing 5 microgram/ml ethidium 
bromide. After electrophoresis, the amplified DNA bands were visualized by an ultraviolet 
10 light. 

As a result, it was found that in each of the cases where the clones jun-B, c-jun, and 
jun-D were used, a single band for the amplified DNa was observed at the position of 
approximately 270 bp. This indicates that the above-mentioned set of primers can recognize 
and amplify ail three types of mouse jun oncogenes, such as jun-B, c-jun, jun-D. 

15 (1) Alternative Synthesis of PCR primers for amplification of iun gene specific mRNA 

Sense (jun-s) and antisense (jun-as) oligodeoxyribonucleotides specific for jun 
oncogenes were synthesized in a DNA synthesizer as described above. The nucleotide 
sequences were as follows: 

jun-s S'-CCCTGAAGGAAGAGCCGCAGAC-T (Seq ID NO: 11) 

20 jun-as 5'-CGTGGGTCATGACTTTCTGCTTGAGCTG-3 1 (Seq ID No: 12) 

(2) PCR amplification of cDNA 

To 1 [i\ of ss-cDNA as described in paragraph (IV) t 0. 1 ( 1 pi) each of jun-s and 
jun-as oligonucleotides, 5 ^1 of 10X PCR buffer (Promega), 1 \x\ of 25 mM MgCU, 4 \il of 
10 mM dNTP mixture (Promega), 0.5 pi Taq polymerase (Promega), and 36.5 \x\ of DEPC- 

25 water were mixed. Two drops of mineral oil were overlayered onto the reaction mixture to 

prevent evaporation. The reaction mixture was first heated at 95°C for 10 minutes, then 
PCR was carried out in a Thermal cycler (Model 480. Perkin-Elmer Cetus) with 30 cycles 
of annealing at 55°C for 1.5 minutes, extension at 72°C for 4 minutes and denaturing at 
95° C for 1.5 minutes as described above. 

30 Ten p. I of the resultant PCR products were then mixed with 1 \i\ of 10X loading 

buffer, and applied to a 1.5% agarose gel containing 5 [ic/ml ethidium bromide. 
Electrophoresis was carried out in a IX TBE buffer at 100 V/fo cm width for 
approximately i hour. After electrophoresis, fluorescent DNA bands were recorded onto 
Polaroid film. Lanes 1-8 of Figure 17 indicate duplicate experiments of a 1:1000 dilution. 
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full strength (equivalent to i ml of blood), 1:10, and 1:100 dilution respectively of a 
leukocyte lysate. As shown in Figure 17, the jun oncogene was amplified from the 1:10 and 
1:100 dilution displaying an estimated size of 370 base pairs. This size is similar to the 
372bp theoretical value. 
5 EXAMPLE 8 

The effect of nretreatm ent with EGF on expression of 
jun oncogenes i n human mononuclear leukocytes 

(1) Pretreatment of human leukocytes with KQF 

40 ml of phosphate buffered saline (PBS) is added to 20 ml of heparinized human 
10 blood and mixed. 10 ml each of this sample is overiayered onto 3 ml of IsoLymph, then 

centrifuged for 30 minutes at 400 x g. After washing the pellet three times with PBS, the 
pellet is resuspended in 3 ml of PBS. 1 ml each of this sample is then placed in three tubes 
(No. 1 to No. 3). 1 ml of PBS Is placed in the fourth tube as control. The total of four 
tubes are incubated for 10 minutes at 37 degrees C, then EGF (Epidermal Growth Factor) 
15 is placed in tube No. 1 at a final concentration of 30 ng/ml. After 15 minutes, EGF is 

placed in the same manner in tube No. 2, and incubated for another 5 minutes. 'After 5 
min, RNA is extracted from all 4 tubes simultaneously. Thus, the time period W the 
pretreatment of human leukocytes by EGF is 20 minutes for tube No. 1, 5 minutes for tube 
No. 2, 0 minutes for tube No. 3. 
20 (2) Extraction of RNA from cetk 

The above four tubes are taken out and undergo centrifugation by microfuge for 10 
seconds. Then, the supernatant is discarded and the below described lysis buffer is added 
to the pellet, and after thorough mixing, this is incubated for 30 minutes at 45 deizrees C. 
Contents of lysis buffer: 
25 10 mM EDTA pH 8.0 

0.5% SDS (bacteria removed by non-bacterial filter) 
0.2 M NaCl 
DEPC treated water 
RNA inhibitor 500 unites/ml 
30 Vanadyl Complex 10 mM 

Proteinase K 200 microgram/ml 
(3) Purification of mRNA 

After 5 M NaCl is added to the above cell lysates to obtain a final concentration of 
0.5 M> oligo (dT) cellulose (Stratagene Co.) is added, and undergoes a reaction for 30 
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mmutes at room temperature. Then, after washing the cellulose in 10 ml binding buffer (20 
mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl) 5 times. 0.35 ml of DEPC treated water 
is added, and mRNA is eluted from the solid phase. Then, 53 microliters of 2 M sodium 
acetate and 2.5 times the volume of ethanol are added, and after cooling in a dry ice for 20 
5 minutes, this undergoes centrifugation at 15,000 rpm for 20 minutes. After washing the 

pellet in 75% ethanol one time, this is dried, and then dissolved in 10 microliters of DEPC 
treated water. 

(4) Synthesis of cDNA 
50 mM Tris-HCl (pH 8.3). 75 mM KCI, 3 mM magnesium chloride, 10 mM DTT, 

0.5 mM dNTP (dATP, dCTP. dGTP, dTTP), 50 micrograms/ml oligo (dT) primer, and 
10.000 units/ml reverse transcriptase are added to 10 microliters of mRNA obtained from 
above to a total volume of 20 microliters, and this undergoes a reaction for one hour at 37 
degrees C. After the reaction. 20 microliters of a phenol:chloroform:isoamyl alcohol mixture 
is added, and cooled for 20 minutes in a dry ice to precipitate the cDNA. After 
centrifugation for 20 minutes at 10.000 rpm. the peUet is washed one time in 75% ethanol. 
Then, after drying, this is dissolved in 20 microliters of autoclaved water, and stored at -20 
degrees C. 

(5) PCR 

1 microliter each of sense primer and anti-sense primer (1 mg/ml) for jun gene 
20 amplification are added with 2 microliters of cDNA. After mixing this with 50 mM KCI. 10 

mM Tris-HCl (pH 8.4). 2.0 mM magnesium chloride, 100 micrograms/ml gelatin, and 0.2 
mM dNTP. 2.5 units of Taq polymerase are added (final volume is 50 ml). After reaction 
mixture was heated at 95 degree C for 10 minutes. PCR was carried out with the following 
cycles 30 times: annealing at 55 degrees C for 1.5 minutes, extension at 72 degrees C for 4 
25 minutes, and denaturing at <J5 degrees C for 1.5 minutes. 

(*») Agarose gel electrophoresis 

After completing PCR, 10 microliters of the reacted solution is taken and undergoes 
electrophoresis in the same method as described in Example 7. The results showed that in 
the leukocytes which did not undergo treatment with EGF (i.e.. sample which underwent 0 
treatment time in tube No. 3) was found to have a minimal band for the amplified DNA at 
the position of about 270 bp size. However, it was found that the band of amplified DNA 
was increased after 5 min. then returned to the basal levels within 20 min. 

EXAMPLE 9 

Effect of pretreatment wi th PHA on jun gene expression in human leukocytes 
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Fn the place of EGF, PHA (at a final concentration of 10 micrograms/ml) was 
utilized and the pretreatment times were set at 0 minutes. 5 minutes, 15 minutes, and 30 
minutes. Other procedures were identical to those of Example 8. As a result, with the 
pretreatment with PHA done at 15 minutes, it was found that the band for the amplified 
DNA at the position of approximately 270 bp was maximized, but (the time periods) 
afterwards, this band decreased. 

As with the other mRNAs that can be analyzed in accordance with the methods of 
the present invention, a variety of jun sequences could serve as sense or antisense primers 
for PCR methods or as probes for the detection of mRNA as described herein. A method 
for identification of such sequences that are either common to a variety of jun mRNAs or 
specific to a particular species is provided hereinbelow. In the preferred embodiment of this 
method of identification, a computer program is used to identify the sequences. Through 
use of such a program, we have identified a large number of both common and specific 
primers and probes. Provided as Tables 19 through 24 are various sense sequences 
identified through the use of such a program that are useful as jun gene probes and primers. 



All of the sequences listed in these tables are useful within the context of the PCR 
methods of the present invention. The complementary antisense sequences are also useful 
in certain aspects of the invention. As will be known by those having ordinary skill in the 
art. for common probes that are similar, but not identical to target sequences, stringency 
conditions can be varied (e.g. by changes in temperature and salinity) so that such probes 
will hybridize or fail to hybridize with a particular target sequence. Thus, also included 
within the present invention are sequences thar are capable of hybridizing with the same 
sequences as either the sense sequences listed or their anti-sense counterparts. Additional 
25 probes for jun genes include the following: 

Common iun gene probes 

5'-CCATGTCGATGGGGGACAGCGG-3 (SEQ ID NO: 13) 

5*-CTGTTTAAGCTGCGCCACCTG-3 (SEQ ID NO: 14) 

5'-GTCTGCGGCTCCTCCTTCAGGG-3'(SEO ID NO: 15) 

5'-CGTGGGTCAAGACTTTCTGCTTGAGCTG-3- (SEQ ID NO: 16) 
Specific probes 

B type: 5-CACTTGGTGGCCGCCAG-.r (SEQ ID NO: 17) 

C type: 5 -GAGCATGTTGGCCGTGG-.V (SEQ ID NO: 18) 

Human D type: 5 '-G A TG CG CTCCTG CG TG T- 3 ' (SEQ ID NO. 19) 
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Mouse D type: 5 -GCCTGTTCTGGC 1 1 1 1 GAGGG-.T (SEQ ID NO: 20) 
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Table 21. Jun-B specific probe (B-1256). 

GenBank 
81258(5' -3') 

CTGGCGCCCACCAAGTG 

Hunan " ' 

B: HUMJUNCA /ccn tJrt _ 

HUMJUNA A..C-T-T 55 £ NO' « 

B: MUSJUNBA .„ un 

r . MI , CIMUr <SEQ 10 NO: 65) 

Mulrjnu A-T--C — T (SEQ 10 NO: 66) 

A-T--C---T (SEO 10 NO: 67) 

MUSJUN A-T--C---T (SEQ in no- API 

0: MUSJUND c rcr r Air, m u« l£ } 

MUSJUNDA T- — -GCCA All n ^ 

MUSJUNDR I CC (SE ° 10 N0: 70 > 

Rat MUSJUN0R T GCCA- ( SEQ 10 NO: 71) 

C: RATRJG9 T GCCAA (SEO ID NO* 72) 

RAT JUNAP A-C--T---T « ft " VZ' iff 

Chicken (SE0 10 N0: 73 > 

QuaU CHKJUN ' C G CCC (SE0 10 H0: 74 > 

OrlLZtT GGC AG TG- ( S6 0 .0 NO: 75) 

C: DR0JUN G— -T--AT (SEQ ID NO: 76) 

Homologous sequences in GenBank 

m?7^ c i9Ure I NUClC0 - G * G < SEQ '0 NO: 77) 

2' F,9ure 2 - N ^ le °- A cc (SEO ID NO: 78) 

hmmpmpd? 9ree " C ° ne Ph ° t0 - T " AA < SEQ 10 79) 

HUMPIGMF2 colour-bUnd pho. T-AA (SEO ID NO: 81) 
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Tabte 22. c-jun specific probe (C2H7). 
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Subtype GenBanlc name 



Human 




8: 


HUMJUNCA 


C: 


HUMJUNA 


0: 


HUMJUNDR 


Mouse 




B : 


MUSJUNBA 


C: 


MUSJUN 




MUSCJUN 




MUSJUNC 


D: 


MUSJUNO 




MUSJUNDR 




MUSJUNDA 


Rat 




C: 


RATJUNAP 




RATRJG9 


Ch icken 




C: 


CHKJUN 


Qua i I 




C: 


QULJUN 


Drosophi la 


C: 


DROJUN 



C2H7.DNA 
CCACGGCCAACATGCTC 



-G- 



-A---C---CAGC 

•--G-G-C G 

C CAGC 



CCTACA- 



Homotogous sequences in GenBank 

MXBPALPA L.enzymogenes 

MX8PALP L.enzymogenes 



-G-C- 
•G-C- 
-G-C- 



-G 
* G 
*G 



-CG- 
•CG* 



(SEQ ID NO: 82) 
CSEQ 10 NO: 83) 
(SEQ ID NO: 84) 

(SEQ 10 NO: 85) 
(SEQ 10 NO: 87) 
(SEQ 10 NO: 88) 
(SEQ ID NO: 89) 
(SEQ ID NO: 90) 
(SEQ ID NO: 91) 
(SEQ ID NO: 92) 



(SEQ ID NO: 
(SEQ ID NO: 



93) 
94) 



(SEQ ID NO: 95) 

(SEQ 10 NO: 96) 

-ACC (SEQ 10 NO: 97) 



(SEQ ID NO: 
(SEQ 10 NO: 



98) 
99) 



35 
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Table 23. Human jun-D specific probe ( HUMD965 ) 



Subtype GenBank name 



HUM096S 
ACACGCAGGAGCGCATC 



Human 




B: 


HUMJUNCA 


C: 


HUM J UNA 


D: 


HUMJUMOR 


Mouse 




B: 


MUSJUN8A 


C: 


MUSJUNC 




MUSCJUN 




MUSJUN 


D: 


MUSJUND 




MUSJUNDA 




MUSJUNDR 


Rat 




C: 


RAT JUNAP 




RATRJG9 


Chicken 




C: 


CHKJUN 


Qua i t 




C: 


QULJUN 


Drosoph t la 


C: 


DROJUN 



GG-A--T- 
•GT-C--- 



-GT- 



•CT- 



G--A- 



Homologous sequences in GenBank 



CELPOUI 
ECOPRIAY 
SIN0CK82 
TRPPROC 
ECOCYSJIHA 



C.elegans RNA 

E-colf primo. -ACA- 

Ockelbo B2 g. C 

Treponema pa. G- — 
E.coli NADPH 



— G — 



-AGAC 

-GT-T G— - 

-GT - T G 

-GT-T --G--- 



-GT-T--- G--- 

-GT-T G--- 



-A-A- 



-A-A- 



(SEO 10 NO: 100) 
(SEQ ID NO: 101) 
(SEQ ID NO: 102) 



(SEQ ID 
(SEO 10 
(SEO 10 
(SEQ ID 
(SEQ 10 
(SEQ ID 
(SEQ ID 



NO: 103) 
NO: 104) 
NO: 105) 
NO: 106) 
NO: 107) 
NO: 108) 
NO: 109) 



(SEQ ID NO: 
(SEQ ID NO: 



110) 
111) 



(SEQ ID NO: 112) 
(SEQ ID NO: 113) 
(SEQ ID NO: 114) 



A-AA 


(SEQ 


ID 


NO: 


115) 




(SEQ 


ID 


NO: 


116) 




(SEQ 


ID 


NO: 


117) 




(SEO 


ID 


NO: 


118) 




(SEQ 


ID 


NO: 


119) 
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Table 24. Mouse jun-0 specific probe (MUS01063). 



Subtype GenBank name 



HUSO 1063. DMA 
CCCTCAAAAGCCAGAACACCC 



Human 



8: 


HUMJUNCA 


C: 


HUM J UNA 


0: 


HUMJUNDR 


Mouse 




B: 


MUSJUNBA 


C: 


MUSJUNC 




MUSJUW 




MUSCJUN 


D: 


MUSJUN0A 




MUSJUNDR 




MUSJUND 


Rat 




C: 


RATRJG9 




RAT JUNAP 


Chicken 




C: 


CHKJUN 


Qua i I 




C: 


QULJUN 


Drosoph i la 


C: 


DROJUN 



-G GGC-G — 

AA- -GC-C ■ 



--G-G- (SEQ ID NO: 120) 

GC CSEQ 10 NO: 121) 

G- (SEQ 10 NO: 122) 



TA TCC TCTG C 

AA - - GC* T GC 

AA--GC-T - GC 

AA--GC-T GC 



(SEQ 10 NO: 
(SEQ 10 NO: 
(SEQ 10 NO: 
(SEQ 10 NO: 



(SEQ 
(SEQ 
(SEQ 



ID NO: 
10 NO: 
10 NO: 



123) 
124) 
125) 
126) 
127) 
128) 
129) 



G GCC GC-C TT (SEQ ID NO: 130) 

G GCC GC-C TT (SEQ ID NO: 131) 

A---G-GG-A-A G--- ( SEQ ID NO: 132) 

—-C--G-T TG--G-A (SEQ ID NO: 133) 

A GGA TGTGGCGC (SEQ ID NO: 134) 



Homologous sequences in GenBank 

M27221 TG T--T (SEQ ID NO: 

OROAMY D.erecta G G-A--T (SEQ ID NO* 

DROAMYQ D.erecta G G-A--T (SEQ 10 NO' 

HUMTGCMUOE Immunoglob. A--C GTT (SEQ ID NO: 



135) 
136) 
137) 
138) 
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Idenrifving Probes and PCR Primers 

Probes and PCR primers for use in the methods of the present invention can be 
identified in any way known to the art. Preferably, however, such probes and primers are 
identified by a computer. We have developed a novel computer system for identifying the 
5 sequences to be used in such probes and primers. This system is an automated system 

which allows the user to calculate and design extremely accurate oligonucleotide probes 
and PCR primers. 

The software of the present invention runs under Microsoft Windows® on IBM® 
compatible personal computers (PC's). This invention allows a researcher to design 
oligonucleotide probes based on the GenBank database of DNA and mRNA sequences. 
The present invention further allows examination of probes for specificity or commonality 
with respect to a user-selected target gene sequences. Hybridation strength between a 
probe and a target subsequence of DNA or mRNA can be estimated through a 
hybridization strength model. Quantitatively, hybridization strength is given as the melting 
15 temperature (Tm). 

Two models for estimating hybridization strength models are supported by this 
invention: 1) the Mismatch Model and 2) the H-Site Model. In either case, the user can 
select the following calculations for each probe, results of which are then made available 
for display and analysis. I) Sequence, Melting Temperature (Tm) and Hairpin 
characteristics (a hairpin is a nucleoside sequence that is homologous to itself and can 
"fold back" with one portion of the probe hybridizing to another portion of the same 
probe); 2) Hybridization to other species within the preparation mixture; and (3) Location 
and Tm for the strongest hybridizations. The results of the invention s calculations are 
then displayed on a Mitsuhashi Probe Selection Diagram (MPSD) which is a graphic- 
display of all potential hybridizations between the target mRNA and the probe sequences 
in the preparation. 

The Main OligoProbe Design Station dialog window controls all user-definable 
settings in the program. The user is offered a number ot options at this window. The File 
option allows the user to print, print in color, save selected probes, and exit the program. 
The Preparation option allows the user to open and create preparation (PRP) files. The 
Models option allows the user to chose between the two hybridization models currently 
supported by the OligoProbe DesicnStation: 1) the H-Site Model and 2) the Mismatch 
Model. 
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If the user selects the H-Site Model option, the melting temperature tor each probe 
and the nucleation threshold parameters can be set. The nucleation threshold is the 
number of base pairs constituting a nucleation site (a subsequence with an exact match). 
If the user selects the Mismatch Model option, the probe length and mismatches (N) can 
5 be set. 

Mismatch Model 

The Mismatch Model is used for designing DNA and mRNA probes utilizing 
sequence database information from sources such as GenBank. In this Model, 
hybridization strength is related only to the number of base pair mismatches between a 
10 probe and its target. Generally, the more mismatches a user allows when setting 

parameters, the more probes will be identified. The Mismatch Model does not take into 
account the GC content of candidate probes so there is no calculation of the probe's 
binding strength. 

The basic technologies employed by the Mismatch model are hashing and continuous 

15 seed filtration. Hashing involves the application of an algorithm to the records in a set of 

data to obtain a symmetric grouping of the records. When using an indexed set of data 
such as a database, hashing is the process of transforming a record key to an* index value 
for storing and retrieving a record. The Mismatch Model is essentially a quick process for 
determining exact and inexact matching between DNA and mRNA sequences to support 

20 the Mitsuhashi Probe Selection Diagram (MPSD). 

The algorithm used by the Mismatch Model is based on the Waterman-Pevzner 
Algorithm (WPALG). which is a computer-based probe selection process. Essentially, this 
is a combination of new and improved pattern matching processes. See Hume and Sunday 
(1991, Ref. 4), Landau et a I (1986-1990, Refs. 6, 7, 8). Grossi and Luccio (1989. Ref. 3), 

25 and Ukkonen (1982, Ref. 14). 

There are three principal programs that make up the Mismatch Model in this 
implementation of the invention. The first is designated by the inventors as "k diff." 
WPALG uses k_diff to find all locations of matches of length greater than or equal to one 
(1) (length is user-specified) with less than or equal to k number of mismatches (k is also 

30 user-specified) between the two sequences. If a candidate oligonucleotide probe tails to 

match that well, it is considered unique. k_diff uses hashing and continuous seed filtration, 
and looks for homologs by searching GenBank and other databases with similar tile 
formats. The technique of continuous seed filtration allows tor much more efficient 
searching than previously implemented techniques. 
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A seed is defined in this invention to be a subsequence having a length equal to the 
longest exact match in the worst case scenario. For example, suppose the user selects a 
probe length (1) of 18, with 2 or fewer mismatches (k). If a match exists with 2 
mismatches, then there must be a perfectly matching subsequence of length equal to 6. 
5 Once the seed length has been determined, the Mismatch Model looks at all substrings of 

that seed length (in this example, the seed length would be 6). finds the perfectly matched 
base pair subsequence of length equals 6. and then looks to see if this subsequence extends 
to a sequence of length equal to the user selected probe length (i.e., 18 in this example). 
If so, a candidate probe has been found that meets the user's criteria. 
10 Where the seed size is large (i.e.. a long string of unique nucleotides), the program 

allocates a relatively large amount of memory for the hash table. This invention has an 
option that allows memory allocation for GenBank entries just once at the beginning of 
the program, instead of reallocating memory for each GenBank entry. This reduces input 
time for GenBank entries by as much as a factor of two (2), but this method requires the 
15 user to know the maximum GenBank entry size in advance. 

A probe is found to hybridize if it has k or fewer mismatches with a target sequence 
from the database or file searched. The hit extension time for all appropriate parameters 
of the Mismatch Model has been found by experimentation to be less than thirty-five (35) 
seconds, except in one case where the minimum probe length (I) was set to 24 and the 
20 maximum number of mismatches (k) was set to four (4). This situation would rarely be 

used in real gene localization experiments because the hybridization conditions are too 
weak. 

H-Site Model 

In this embodiment of the invention, the second hybridization strength model is 
25 termed the H-Site Model. One aspect of the H-Site Model uses a generalization of an 

experimental formula to analize nucleotide binding strength. The basic formula on which 
this aspect of the model is built is as follows: 

Tm = 81.5 - l6.6(log[Na]) - .63 %(formamide) + .41 (<~c(G + Q) - 600 / N 
In this formula, log[Na] is the log of the sodium concentration. %(G + C) is the fraction 
of matched base pairs which are G-C complementary, and N is the probe length. This 
formula relates the fact that melting temperature is a function of both probe length and 
percent GC content. This basic formula has been modified in this invention to account 
for the presence of mismatches. Each percent of mismatch reduces the melting 
temperature by an average of 1 .25° (2°C for an AT mismatch, and 4°C for a GC 
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mismatch). This formula is. however, an approximation. The actual melting temperature 
might potentially differ from this approximation, especially for short probes or probes with 
a relatively large number of mismatches. 

Hybridization strength in the H-Site Model is related to each of the following factors: 
5 1) "binding region"; 2) type of mismatch (GC or AT substitution); 3) length of the probe; 

4) GC content of the binding region; and 5) existence of a "nueleation site" (a subsequence 
with an exact match). The type of mismatch and GC content of the binding region from 
each sequence contributes to a candidate probe's binding strength. The binding strength 
from each probe is thereby determined enabling the user to select an optimal probe. 
10 The fundamental assumption of the H-Site Model is that binding strength is mostly 

determined by ;i paired subsequence of the probe and target, called the binding region. 
If the subsequence binding region contains more GC pairs than AT pairs, the binding 
strength will be higher due to the greater number of hydrogen bonds between G and C 
bases (three bonds) in comparison to A and T bases (two bonds). Thus. GC rich probes 
15 have a higher melting temperature and subsequently form stronger hybridizations. 

In the H-Site Model the program determines optimal probes, ideaLly without any 
mismatches to the target gene. With this model, however, a candidate probe can have 
more AT mismatches if the sequence is GC rich. The amount of allowable AT mismatches 
in a specific sequence is determined in the present invention program by looking primarily 
20 at subsequence regions of the probe and target that match without penalizing the probe 

for areas that mismatch. If the mismatches are located at either or both of the ends of 
the binding region, there is little effect on the overall stability of the base-pairing. 
Centrally located mismatches in the binding region are much more deleterious, as this will 
significantly lower the binding strength of the probe. 
25 The formula cited above for the melting temperature applies within the binding 

region. The length of the probe is used to calculate percentages, but all other parameters 
of the formula are applied to the binding region only. The H-Site Model further assumes 
the existence of a nueleation site. The length of this nueleation site may be set by the 
user. Typically, a value of S to 10 base pairs is used. To complete the H-Site Model, the 
30 binding region is chosen so as to maximize the melting temperature Tm among all regions 

containing a nueleation site, assuming one exists (otherwise, Tm = 0). 

The H-Site Model is more complex than the Mismatch Model discussed a hove in that 
hybridization strength is modeled as a sum of multiple subsequence contributions, with 
matches generally providing positive binding energy and mismatches generally providing 
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negative binding energy. The exnct binding energies to be used depend only on the 
matched or mismatched pair. These coefficients may be specified by the user, although 
in the current version of this invention these coefficients are not explicitly user-selectable, 
but rather are selected to best fit the hybridization strength formulas developed by Itakura 
5 et al (1984, Ref. 5), Bolton and McCarthy (1962, Ref. 2), Benner em! ( 1973, Ref. 1), and 

Southern (1975, Ref. 13). 

A unique aspect of the H-Site Model is that hybridization strength is determined by 
the optimal binding region between the candidate probe and binding locus. This binding 
region is called the hybridization site, or h-site, and is selected so as to maximize overall 
10 hybridization strength, so that mismatches outside the binding region do not detract from 

the estimated hybridization strength. Several other unique features of the H-Site Model 
include the fact that it is more oriented toward RNA and especially cDNA sequences than 
DNA sequences, and the fact that the user has control over preparation and environmental 
variables. 

15 The emphasis on RNA and cDNA sequences allows the user to concentrate on coding 

regions of genes, rather than necessitating sorting through all of a genomic sequence for 
the desired probe. The enhanced user control over environmental and preparation 
variables allows the user to more accurately simulate laboratory conditions that closely 
correspond with any experiments he or she is conducting. Further, this implementation 

20 of the invention does some preliminary preprocessing of the GenBank database to sort out 

and select the cDNA sequences. This is done by locating a keyword (in this case CDS) 
in each GenBank record, thereby eliminating any sequences containing introns. 

The Mitsuhashi Probe Selection Diagram (MPSD), Figures 28A and 28B. is a key 
feature of this invention, as it is a unique way of visualizing the results of the probe 

25 designed by the Mismatch and H-Site Models. It is a graphic display of all of the 

hybridizations of candidate oligonucleotide probes and the target with all sequences in the 
preparation. Given a gene sequence database and a target mRNA sequence, the MPSD 
graphically displays all of the candidate probes and their hybridization strengths with all 
sequences from the database. In the present implementation, each melting temperature 

30 is displayed as a different color, from red (highest Tm) to blue (lowest Tm). The MPSD 

allows the user to see visually the number of false hybridizations at various temperatures 
for all candidate probes, and the sources of these false hybridizations (with a loci and 
sequence comparison). A locus may be a specitic site or place, or. in the genetic sense. 
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a locus is any of the homologous parts of a pair of chromosomes that may be occupied by 
allelic genes. 

These probes may then be used to test for the presence of precursors of specific 
proteins in living tissues. The oligonucleotide probes designed with this invention may be 
5 used for medical diagnostic kits. DNA identification, and potentially continuous monitoring 

of metabolic processes in human beings. The present implementation of this computerized 
design tool runs under Microsoft* Windows™ v. 3.1 (made by Microsoft Corporation; 
Redmond, Washington) on IBM* compatible personal computers (PC's). 

The H-Site Model of this invention is unique in that it offers a multitude of 
information on selected probes and original and distinctive means of visualizing, analyzing 
and selecting among candidate probes designed with the invention. Candidate probes are 
analyzed using the H-Site Model for their binding specificity relative to some known set 
of m RNA or DNA sequences, collected in a database such as the Gen Bank database. The 
first step involves selection of candidate probes at some or all the positions along a given 
target. Next, a melting temperature model is selected, and an accounting is made of how 
many false hybridizations each candidate probe will produce and what the melting 
temperature of each will be. Lastly, the results are presented to the researcher along with 
a unique set of tools for visualizing, analyzing and selecting among the candidate probes. 

This invention is both much faster and much more accurate than the methods that 
are currently in use. It is unique because it is the only method that can find not only the 
most specific and unique sequence, but also the common sequences. Further, it allows the 
user to perform many types of analysis on the candidate probes, in addition to comparing 
those probes in various ways to the target sequences and to each other. 

Therefore, it is the object of this invention to provide a practical and user-friendly 
system that allows a researcher to design both specific and common oligonucleotide 
probes, and to do this in less time and with much more accuracy than currently done. 

This invention is employed in the form best seen in Figure 20. There, the 
combination of this invention consists of an IBM* compatible personal computer (PC), 
running software specific to this invention, and having access to a distributed database with 
the file formats found in the GenBank database and other related databases. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those described 
herein can be used in the practice or testing of the present invention, the preferred 
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methods and materials are now described. All publications mentioned hereunder are 
incorporated herein by reference. 

The preferred computer hardware capable of operating this invention involves of a 
system with at least the following specifications (Figure 20): 1) an IBM® compatible PC, 
5 generally designated 1A. IB. and 1C, with an 80486 coprocessor, running at 33 Mhz or 

faster; 2) 8 or more MB of RAM, 1A; 3) a hard disk IB with at least 200 MB of storage 
space, but preferably 1 GB; 4) a VGA color monitor (1C) with graphics capabilities of a 
size sufficient to display the invention's output in readable format, preferably with a 
resolution of 1024 x 768; and 5) a 580 MB CD ROM drive 5 (IB of Figure 20 generally 
refers to the internal storage systems included in this PC, clockwise from upper right, two 
floppy drives, and a hard disk). Because the software of this invention preferably has a 
Microsoft® Windows"* interface, the user will also need a mouse 2, or some other type of 
pointing device. 

The preferred embodiment of this invention would also include a laser printer 3 
and/or a color plotter 4. The invention may also require a modem (which can be internal 
or external) if the user does not have access to the CD ROM versions of the GenBank 
database 8 (containing a variable number of gene sequences 6). If a modem is used, 
information and instructions are transmitted via telephone lines to and from the GenBank 
database 8. If a CD ROM drive 5 is used, the GenBank database (or specific portions of 
20 it) is stored on a number of CDs. 

The computer system should preferably have at least the Microsoft® DOS version 5.0 
operating system running Microsoft® Windows™ version 3.1. All of the programs in the 
preferred embodiment of the invention were written in the Borland® C+ + (Borland 
International. Inc.; Scotts Valley. CA) computer language. It should be noted that 
subsequently developed computers, storage systems, and languages may be adapted to 
utilize this invention and vice versa. 

This inventive computer program is designed to enable the user to access DNA. 
mRNA and cDNA sequences stored either in the GenBank or in databases with similar 
file formats. GenBank is a distributed flat file database made up of records, each record 
containing a variable number of fields in ASCII file format. The stored database itself is 
distributed, and there is no one database management system (DBMS) common to even 
a majority of its users. One general format, called the line type format, is used both for 
the distributed database and lor all of GenBanks internal record keeping. All data and 
system files and indexes for GenBank are kept in text files in this line type format. 
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The primary GenBank database is currently distributed in a multitude of files or 
divisions, each of which represents the genome of a particular species (or at least as much 
of it as is currently known and sequenced and publicly available). The GenBank provides 
a collection of nucleotide sequences as well as relevant bibliographic and biological 
5 annotation. Release 72.0 (6/92) of the GenBank CD distribution contains over 71,000 loci 

with a total of over ninety-two (92) million nucleotides. GenBank is distributed by 
IntelliGenetics, of Mountain View, CA, in cooperation with the National Center for 
Biotechnology Information, National Library of Medecinge, in Bethesda, MD. 

10 1. Overall Description of the OlicoProbe DesignStation 

a. Genera! Theory 

The intent of this invention is to provide one or more fast processes for performing 
exact and inexact matching between DNA sequences to support the Mitsuhashi Probe 
Selection Diagram (MPSD) discussed below, and other analysis with interactive graphical 

15 analysis tools. Hybridization strength between a candidate oligonucleotide probe and a 

target subsequence of DNA, mRNA or cDNA can be estimated through a hybridization 
strength model. Quantitatively, hybridization strength is given as the melting temperature 
(Tm). Currently, two hybridization strength models are supported by the invention: 1) 
the Mismatch Model and 2) the H-Site Model. 

20 b. Inputs 

i. Main OligoProhe DesignStation Dialog Window 

The Main Oligo Probe DesignStation dialog window, Figure 21, controls all user- 
definable settings. This window has a menu bar offering five options: 1) File 10; 2) 
Preparation 80; 3) Models 70; 4) Experiment 90; and 5) Help 50. The File 10 option 

25 allows the user to print, print in color, save selected probes, and exit the program. The 

Preparation 80 option allows the user to open and create preparation (PRP) files. 

The Models 70 option allows the user to chose between the two hybridization models 
currently supported by the OligoProbe DesignStation: 1) the H-Site Model 71 and 2) the 
Mismatch Model 75. If the user selects the H-Site Model 71 option, the left hand menu 

30 of Figure 21C is displayed and the user sets the following model parameters: 1) the 

melting temperature Tm 72 tor which probes are being designed (i.e., the melting 
temperature that corresponds to a particular experiment or condition the user desires to 
simulate); and 2) the nuclealion threshold 73. which is the number of base pairs 
constituting a nucleation site. If the user selects the Mismatch Model 75 option, the right 
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hand menu of Figure 21C is displayed and the user sets the following model parameters: 
1) probe length 76, which is the number of base pairs in probes to be considered; and 2) 
mismatch N 77, which is the maximum number of mismatches constituting a hybridization. 
Computation of the user's request takes longer with the H-Site Model if the threshold 73 
5 setting is decreased, but longer with the Mismatch Model if the number of mismatches K 

77 is increased. 

In addition, for both Model options the user chooses the target species 11 DNA or 
mRNA for which probes are being designed and the preparation 12, a file of all sequences 
with which hybridizations are to be calculated. A sample of a target species file is shown 
10 in Figure 47 (humbjunx.cds), while a sample of a preparation file is shown in Figures 48A- 

48C (junmix.seq). Each of these inputs is represented by a- file name and extension in 
standard DOS format. In the target species and preparation fields, the file format follows 
the GenBank format with each of the fields having a default file extension. Pressing the 
"OK" button 91 (Figure 2 1C) will initiate processing, while pressing the "Cancel" button 93 
15 will stop the processing. 

The Experiment 90 option and the Help 50 option are expansion options not yet 
available in the current implementation of the invention, 
c Processing 

Figures 22A and 22B are flow charts of the overall OligoProbe DesignStation 
Program, illustrating its sequence and structure. Generally, the main or "control" program 
of the OligoProbe DesignStation performs overall maintenance and control functions. This 
program, as illustrated in Figures 22A and 22B. accomplishes the general housekeeping 
functions 51. such as defining global variables. The user-friendly interface 53, carries out 
the user-input procedures 55. the file 57 or database 59 access procedures, calling of the 
25 model program 62 or 63 selected by the user, and the user-selected report 65 or display 

67, 69, 71 and 73 features. Each of these features is discussed in more detail in later 
sections, with the exception of the input procedures, which involves capturing the user's 
set-up and control inputs, 
d. Outputs 

30 The Mitsuhashi Probe Selection Diagram Window 

The Mitsuhashi Probe Selection Diagram (MPSD), Figure 23. is a key feature of the 
invention as it is a unique way of visualizing the results of the program's calculations. It 
is a graphic display ot all of the hybridizations of probes with the target oligonucleotides 
in the preparation. Specifically, given a nucleotide sequence database and a target mRNA, 
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(57) Abstract 



An oligonucleotide including at least one concatenation coding for a polypeptide of formula (P-K)n, wherein n is an integer of at 
least 2, P is a proline amino acid residue, K is a lysine amino acid residue, and the sign is a bond, particularly a peptide bond, between 
the two amino acid residues. The n (P-K) units are also bound together by such bonds, e.g. peptide bonds. 

(57) Abrege* 

L'invention a pour objet un oligonucleotide comprenant au moins un enchainement codant pour un polypeptide repondant a la formule 
(P-K)n, dans laquelle: n est un entier supetieur ou 6gal a 2, P represente un residu d'acide amin6 Proline, K represente un residu d'acide 
amine* Lysine, le signe "-" symbolise une liaison entre les deux residus d'acides amines et notamment une liaison de type peptidique, les n 
unites (P-K) etant liees entre elles egalement par de telles liaisons par exemple de type peptidique. 
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Proteines de res rve de plant s enrichies en acides amines, 
notamment y-zeine de mats nrichi en lysine Plantes exprimant ces 
proteines 

5 La presente demande a pour objet de nouveaux moyens permettant 

la preparation de plantes exprimant des proteines de reserve enrichies en 
acides amines deficitaires dans les proteines de reserve normales, et en 
particulier de proteines de reserves enrichies en lysine, ^invention a 
egalement pour objet les proteines de reserve ainsi modifiees, ainsi que 

10 des plantes exprimant ces proteines de reserve modifiees. 

De nombreuses plantes, le cas echeant apres transformation par 
des etapes physico-cnimiques, sont d'une importance economique majeure 
dans ('alimentation humaine ou animale, et le probleme de ('amelioration 
de leur qualite nutritionnelle a deja donne lieu a differents types de 

15 recherches. En particulier, pour remedier a Pinsuffisante presence de 
certains acides amines dans les proteines de reserve des plantes, on a mis 
au point des varietes selectionnees presentant des qualites nutritionnelles 
superieures, ou propose differentes modifications faisant appe! a 
Tutilisation des techniques du genie genetique afin de favoriser ou 

20 d'augmenter la production dans ces plantes notamment de certains acides 
amines deficitaires, et neanmoins importants pour les qualites 
nutritionnelles de la plante. Ces acides amines deficitaires sont par 
exemple la lysine, la methionine. 

Dans le cadre de la presente demande, les inventeurs se sont 

25 propose d'apporter une solution originale au probleme de Tamelioration 
des plantes, notamment de Tamelioration de leurs qualites nutritionnelles 
et ont a cet effet travaille en premier lieu a partir d'une plante d'importance 
economique considerable, le mai's. Plus precisement, ils se sont interesses 
aux proteines de reserve de I'endosperme d s semences de maTs, 
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proteines qui comprennent notamment les zeines et, en particulier, la y- 
zeine. 

Au cours du developpement des semences de mats, les cellules de 
Tendosperme synthetisent de grandes quantites de proteines de reserve, 
5 et notamment de a-, fh et y-zeines. Ces zeines sont accumulees dans les 
corps proteiques derives du reticulum endoplasmique (ER). 

D'une fa?on generate, les zeines represented un groupe complexe 
de proteines, reparties en plusieurs groupes, a-, y- et 5-zeines (Larkins 
et a!, 1989), codees par une famille multigenique (Hagen et Rubenstein, 

10 1980, Gene 13, 239-249). Bien qu'elles presentent une structure variable, 
ces proteines ont des caracteristiques communes : la presence dans leur 
structure primaire de repetitions en tandem riches en residus d'acides 
amines de type Proline, la presence de nombreux residus hydrophobes qui 
conferent leur insolubility a ces proteines dans un milieu aqueux, et 

15 I'absence de residus lysine, acides amines essentiels pour I'homme et les 
animaux monogastriques. Uabsence de lysine dans toutes les proteines 
majoritaires (detectees en grande quantite dans Tendosperme) 
naturellement produites de la famille des zeines conduit a une composition 
en acides amines desequilibree dans les semences de mai's. 

20 Parmi ces proteines, la y-zeine de mai's est une proteine ayant un 

poids moleculaire de 28 kDa, dont la sequence codante a ete decrite sous 
la forme de I'ADNc par Prat et al (Nucleic Acids Research, vol. 13, n° 5, 
1985, p. 1494-1504). La sequence complete du gene codant pour la y- 
zeine, y compris les sequences non codantes en amont et en aval 

25 contenant les elements de regulation de ('expression, a ete decrite par 
Reina, M. et al (Nucleic Acids Research, vol. 18, n° 21 , 1990, p 6426). 

Jusqu'a present, differentes approches ont 6te envisagees pour 
augmenter la teneur en lysine des proteines de la famille des zeines. A cet 
egard, des approches genetiques et moleculaires ont ete mises en oeuvre. 
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Par exemple, des mutants permettant I'obtention de mais riche en lysine, 
tels que le mutant opaque-2 (o2) et le mutant floury-2 (fl-2) (Mertz t al, 
1964, Science 145, 279-280, Nelson et al, 1965, Science 150, 1469-1470) 
ont ainsi ete proposes et des tentatives pour remedier aux effets nefastes 
s de ('absence de certaines classes de zeines, notamment les a-zeines, sur 
les caracteristiques phenotypiques, par selection de mais contenant des 
genes modificateurs o2 (Paez et al, 1969, Plant Sci. 9, 251-252, Geetha et 
al, 1991, Plant Cell 3, 1207-1219) ont ete faites. 

Une autre approche a consiste a realiser une action indirecte sur la 

10 production de lysine libre, en particulier dans les plantes dicotyledones. 
Cette technique a ete mise en oeuvre en agissant sur la deregulation des 
enzymes cle (DHTPS et AK) impliquees dans le cycle de la biosynthese de 
la lysine via Taspartate. Un accroissement sensible des niveaux de lysine 
libre a ete obtenu dans les feuilles, mais pas dans les semences, dans des 

is experiences de transformation de plantes de tabac avec des bacteries 
E coli contenant les genes dapA et des bacteries E. coli contenant le gene 
lysC (Shaul et Galili, 1992, Plant J. 2, 203-209 et 1993, Plant Mol. Biol. 23, 
759-768 ; Perl, A., Schaul O., Galili, G., 1992, Plant Molecular Biology, 19, 
p. 81 5-823). Recemment, les memes genes dapA de Corvnebacterium et 

20 lysC de E. coli ont ete utilises et exprimes sous le controle d'un promoteur 
specifique des semences dans des plantes de soja. L'expression de ces 
deux enzymes dans le soja a conduit a une augmentation de cinq fois de la 
teneur en lysine dans les semences (Falco et al, 1995, BIO-Technology 
13, 577-582). 

25 D'autres auteurs (Wallace et al, 1988, Science 240, 662-664) ont 

tente d'augmenter la lysine dans I'a-zeine (19 kDa) des semences de mais 
en incorporant de facon ponctuelle des residus lysine en differentes 
positions dans la molecule d'ct-zeine. L'expression de ces constructions 
dans des oocytes de Xenopus a conduit a I'assemblage correct des zeines 

30 riches en lysine dans les vesicules analogues des corps proteiques. 
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Cependant, I'a-zeine normale et la forme modifiee enrichie en lysine 
etaient degradees lorsqu'elles etaient exprimees dans des semences de 
tabac (Othani et al. 1991, Plant Mol. Biol. 16:117). 

On ne disposait done pas a ce jour de connaissances quant aux 
5 moyens pouvant permettre ['expression d'une zeine enrichie en lysine dans 
les cellules la produisant naturellement dans le mats, e'est-a-dire dans les 
cellules de rendosperme. A fortiori I'expression dans d'autres cellules 
vegetales de zeines enrichies en lysine n'etait pas maltrisee. 

Uun des buts de Pinvention est done de proposer des moyens 
10 permettant d'obtenir une zeine enrichie en lysine, en particulier une y-zeine 
de maYs enrichie en lysine, cette proteine etant exprimee notamment dans 
les cellules des semences de mais et en particulier dans les cellules de 
Tendosperme, ladite proteine modifiee etant en outre exprimee de fa?on 
telle que ses proprietes quant a la localisation et I'accumulation au niveau 
15 du reticulum endoplasmique et des corps proteiques derives soient 
preservees. 

L'expression « enrichie en lysine » signifie, dans le contexte de la 
presente demande, que la proteine comporte un nombre augmente de 
residus lysine par rapport a la proteine naturelle dont elle est derivee, par 
20 exemple par suite d'une modification de la sequence de nucleotides qui 
Texprime. 

^invention propose aussi des moyens pour obtenir I'expression de 
proteines, de preference de y-zeines enrichies en lysine, dans des cellules 
vegetales de differents tissus, par exemples des tissus de feuiUes ou de 
25 racines, et ce le cas echeant dans des cellules vegetales de plantes 
n'exprimant pas naturellement la proteine, en particulier la y-zeine dont la 
production est recherchee. 
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Par ailleurs, selon un mode de realisation particulier de invention, 
d'autres proteines de reserve peuvent etre enrichies en lysine selon des 
modaiites analogues. 

Les inventeurs se sont propose dans un premier temps, pour 
5 realiser I'invention, d'introduire dans le gene codant pour la y-zeine ou 
pour d'autres proteines de reserve du mais ou d'autres plantes, ou dans la 
sequence codante de ce gene, des sequences codant pour des 
polypeptides enrichis en lysine, afm d'obtenir la production de y-zeines ou 
d'autres proteines enrichies en lysine et done, de semences enrichies en 

10 lysine. Differents sites de la sequence codante du gene de la y-zeine ont 
ete identifies comme sites permissifs, (appeles encore "sites neutres") pour 
preparer des sequences de nucleotides ainsi modifiees. 

La presente demande propose done des moyens pour transformer le 
gene codant pour la y-zeine de mais ou pour transformer toute sequence 

is de nucleotides codant pour la y-zeine et derivee de ce gene, de fapon a 
obtenir a partir de I'expression du gene modifie ou plus generalement de la 
sequence de nucleotides modifiee, une proteine enrichie en lysine; ces 
moyens comprennent en particulier des oligonucleotides synthetiques 
codant pour une sequence d'acides amines comportant des residus lysine. 

20 L'invention a egalement pour objet des sequences de nucleotides 

recombinantes ou sequences chimeres susceptibles de coder pour une y- 
zeine enrichie en lysine. 

Entrent egalement dans le cadre de Tinvention des cellules hotes 
transformees par de telles sequences, et en particulier des cellules 

25 vegetales, par exemple des cellules permettant la regeneration de plantes, 
ainsi que des plantes ou des parties de plantes (tissus, organes,...) 
contenant de telles cellules et produisant de fa$on stable des proteines de 
reserve modifiees, et en particulier des y-zeines enrichies en lysine. 
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L'invention a egalement pour objet lesdites proteines modifiees, par 
exemple enrichies en lysine, ainsi que des anticorps diriges contre c s 
proteines. 

Un oligonucleotide approprie pour la realisation de l'invention, 
5 utilisabie pour la preparation de sequences de nucleotides recombinantes, 
est caracterise en ce qu'il comprend au moins un enchainement codant 
pour un polypeptide repondant a la formule (P-K) m dans laquelle : 

- n est un entier superieur ou egal a 2, 

- P represente un residu d'acide amine Proline, 
io - K represente un residu d'acide amine Lysine, 

- le signe "-" symbolise une liaison entre les deux residus d'acides 
amines et notamment une liaison de type peptidique, les n unites (P-K) 
etant liees entre elles egalement par de telles liaisons par exemple de type 
peptidique. 

15 Un oligonucleotide selon Pinvention est done caracterise selon un 

premier mode de realisation, en ce qu'il comprend une sequence codant 
pour une suite de motifs repetes comportant deux acides amines. 

Les codons de ('oligonucleotide peuvent etre identiques pour tous 
les residus Proline et/ou pour tous les residus lysine, lis peuvent aussi etre 
20 differents pour un meme residu d'acide amine, la variation tenant compte 
de la degenerescence du code genetique. 

De preference, cet oligonucleotide est forme par une sequence 
codant pour plus de 2 unites (P-K). De preference n est inferieur ou egal a 
30, en particulier inferieur a 20 et avantageusement n est egal a 4, 5, 6, 7 ( 
25 8, 9 ou10ou15. 

Les "oligonucleotides" selon Tinvention peuvent etre synthetises 
chimiquement par toute technique disponible. 

Le terme "polypeptide" par reference a Tenchainement (P-K)„ 
designe, dans le cadre de la presente demande, une sequence d'acides 
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amines comportant plus de 2 residus d'acides amines et pouvant 
comporter jusqu'a 60 residus d'acides amines. 

Selon une premiere variante de realisation de I'invention, 
('oligonucleotide comprend plusieurs enchainements codant pour un 
5 polypeptide repondant a la formule (P-K) n> identiques ou differents, 
associes en tandem. 

Ces oligonucleotides sont soit des repetitions d'un meme 
enchainement, soit des associations d'enchainement differents. Le nombre 
d'enchainements associes est variable, pouvant par exemple etre compris 
10 entre 2 et 10 enchainements. 

Selon une autre variante de realisation de i'invention, 
I'oligonucleotide precedemment defini est caracterise en ce qu'il comprend 
au moins un enchainement codant pour un polypeptide repondant a la 
formule (P-K) n , dans laquelle la sequence des n unites (P-K) est 
15 interrompue par un ou plusieurs residus decides amines differents des 
residus P ou K. 

De preference, les acides amines supplementaires incorpores dans 
la sequence formee par les unites (P-K) sont choisis pour ne pas modifier 
I'organisation du polypeptide code par Toligonucleotide, ou a tout le moins 
20 pour ne pas provoquer d'interaction avec des acides amines d'une proteine 
dans laquelle ledit polypeptide serait incorpore, dans des conditions qui 
affecteraient la structure et/ou la fonction et/ou la localisation de cette 
proteine. 

Ceci peut en particuiier etre le cas lorsque le nombre d'unites (P-K) 
25 est important ou que plusieurs enchainements formes de sequences 
codant pour des motifs (P-K) n sont associes en tandem et que la 
preparation de ('oligonucleotide correspondant necessite que soient 
synthetisees plusieurs sequences nucletotidiques qui sont ensuite 
associees par exemple au moyen de sequences de liaison (linkers). 
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Selon un autre mode de realisation de ('invention, ^oligonucleotide 
est tel que I'enchainement codant pour le polypeptide comprenant les n 
unites (P-K) est complete a son extremite 5* et/ou a son extremite 3\ par un 
ou plusieurs codons, codant par exemple pour au moins un residu lysine a 
5 I'extremite N-terminale du polypeptide. 

A titre d'exemple, un oligonucleotide prefere selon I'invention est 
caracterise en ce qu'il code pour un polypeptide repondant a la formule (P- 
K), a la formule K-(P-K) 4) ou a la formule 2K (P-K) 4 . 

Selon un mode de realisation particulier, ia composition de cet 
10 oligonucleotide correspond a Tune des sequences decrites dans les pages 
qui suivent et identifies par les designations SEQ ID No:1 et SEQ ID 
No:2. 

Les oligonucleotides precedemment decrits constituent des moyens 
de base pour realiser des sequences de nucleotides recombinantes 
15 capables d'exprimer des proteines ou des polypeptides de reserve de 
plantes enrichis en lysine. 

^invention a done aussi pour objet une sequence de nucleotides 
recombinante comprenant un enchancement de nucleotides codant pour 
une proteine de reserve d'une plante, caracterisee en ce qu'elle comprend 
20 en outre un oligonucleotide selon ['invention, insere en un site de 
I'enchainement de nucleotides choisi de fa?on telle que : 

- Texpression de la sequence de nucleotides dans une cellule 
vegetale determinee permet d'obtenir une proteine de reserve modifiee 
localisee de fa$on identique ou similaire dans cette cellule a la proteine de 

25 reserve normale qui serait exprimee dans la meme cellule, dans ies 
memes conditions par I'enchainement de nucleotides normal codant 
correspondant, et/ou 

- la proteine de reserve modifiee codee par la sequence de 
nucleotides est reconnue immunologiquement par des anticorps produits 

30 contre la proteine de reserve normale correspondante. 
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En particulier, ies anticorps precites sont constitues par un serum 
polyclonal ou sont obtenus contre des epitopes de ia proteine de reserve 
normale qui sont conserves dans la proteine de reserve modifiee. 

Les cellules vegetales auxquelles il est fait reference ci-dessus 
5 comprennent toute cellule vegetale, quelle que soit son origine tissulaire 
ou sa nature. En particulier on s'interesse dans le cadre de I'invention aux 
cellules des organes de reserve, mais aussi aux cellules de feuilles, tiges, 
tubercuies ... 

Par I'expression "proteine de reserve" d'une plante, on entend dans 

10 le cadre de la presente demande une proteine synthetisee durant la 
maturation des semences et qui est utilisee pendant la phase de 
germination comme reserve principale de nutrition. 

De facon generate, il s'agit d'un polypeptide susceptible d'etre 
synthetise dans un tissu de reserve quelle qu'en soit ia localisation dans la 

15 plante ; les proteines de reserve mises en oeuvre dans le cadre de la 
presente demande sont en particulier celles produites dans les graines ou 
les semences des plantes de la famille des cereales, des cruciferes ou des 
legumineuses, et sont par exemple les prolamines, les zeines. 

Le choix du/des site(s) d'insertion de I'oligonucleotide dans 

20 Tenchainement codant pour la proteine de reserve de plante est determine 
par le respect des conditions ci-dessus enoncees. Selon le cas, rinsertion 
peut avoir lieu dans un domaine repetitif (en termes de sequence d'acides 
amines) de la proteine ou au niveau d'une extremite C- ou N-terminale. 

La condition donnee ci-dessus selon laquelle I'expression de la 

25 sequence de nucleotides recombinante de I'invention dans une cellule 
vegetale permet d'obtenir une proteine de reserve modifiee, localisee 
identiquement ou similairement a la proteine de reserve normale qui serait 
exprimee dans les memes conditions dans la meme cellule vegetale, 
comporte par exemple pour les y-zeines synthetisees, ia possibilite d'etre 

30 accumulees dans le reticulum endoplasmique des cellules vegetales qui 
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I'expriment, et en particulier dans les corps proteiques formes a partir de 
ce reticulum endoplasmique, lorsque la proteine est exprimee dans les 
cellules de I'endosperme. 

Pour obtenir ce resultat au moyen des sequences de nucleotides 
5 recombinantes de invention, des systemes d'expression adaptes a I'hote 
cellulaire dans lequel est exprimee la sequence de nucleotides 
selectionnee, et notamment les elements de regulation, par exemple ies 
promoteurs, sont choisis pour leur caractere fonctionnel dans le tissu 
contenant les cellules transformees. Des tests pour effectuer ce choix 
10 peuvent etre realises en s'appuyant sur les differentes constructions 
decrites dans les exemples. 

Pour verifier que les proprietes immunologiques de la proteine de 
reserve modifiee exprimee par la sequence de nucleotides de ('invention 
n'ont pas ete modifiees de fagon consequente, on a utilise par exemple 
15 des antiserums tels que Tantiserum aG2, decrit precisement dans la partie 
experimental qui suit. 

Selon un premier mode de realisation de Pinvention, la sequence de 
nucleotides recombinante est caracterisee en ce qu'elle est obtenue a 
partir d'un enchainement codant de nucleotides qui conduit a I'expression 
20 d'une proteine de reserve naturellement pauvre en lysine. 

De fapon generale, cette sequence de nucleotides recombinante 
code pour une proteine de reserve modifiee derivee d ! une proteine de 
reserve naturellement produite par une plante utilisable en alimentation 
animale ou humaine. 
25 Ainsi, les proteines de reserve dont on modifie la teneur en lysine 

dans le cadre de la presente invention sont avantageusement des 
proteines de reserve de plantes de la famille des cereales, des 
legumineuses ou des cruciferes. Des proteines de reserve particulierement 
importantes sont celles du mai's, en particulier les zeines, et plus 
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specialement la y-zeine du mais, dont on cherche a augmenter la teneur 
en lysine. 

Une sequence de nucleotides recombinante particuliere selon 
Pinvention est caracterisee en ce que I'enchainement codant de 
5 nucleotides codant pour la y-zeine de maTs qu'elle contient, repond a la 
sequence presentee sur la Figure 9. 

D'autres sequences de nucleotides recombinantes de 1'invention 
sont caracterisees en ce que I'enchainement codant de nucleotides 
qu'elles comprennent, code pour une proteine de reserve d'une plante 

10 choisie parmi les suivantes : le soja, le tournesol, le tabac, le ble, I'avoine, 
la luzerne, le riz, le colza, le sorgo, Arabidopsis . 

Selon un mode de realisation pr6fere de r invention, dans la 
sequence de nucleotides recombinante comprenant un enchainement 
codant pour la y-zeine de maTs, 1'oligonucleotide de 1'invention est insere a 

15 la place de renchainement codant pour le domaine Pro-X natureilement 
present dans la sequence d'acides amines de la y-zeine du mais ou a la 
suite de cet enchainement Le domaine Pro-X de la sequence decides 
amines de la y-zeine du mais est constitue par les acides amines situes 
entre les positions 70 et 91 de la sequence decides amines representee a 

20 la Figure 9, correspondant aux nucleotides 265 a 330 de la sequence 
representee a la Figure 9. 

De fagon preferee, dans la sequence de nucleotides de I'invention, 
['oligonucleotide a la place ou a la suite du domaine Pro-X present entre 
les nucleotides 276 et 357 de la sequence representee a la Figure 9. 

25 Selon un autre mode de realisation de invention, dans la sequence 

de nucleotides recombinante comprenant un enchainement codant pour la 
y-zeine de mais, ('oligonucleotide de (Invention est insere a la suite du 
domaine Pro-X conserve dans la sequence de la y-zeine du maTs. 
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Selon une autre variante de realisation, dans la sequence de 
nucleotides recombinante comprenant un enchaTnement codant pour la y- 
zeine de maTs, ('oligonucleotide de I'invention est insere dans te domaine 
Pro-X maintenu dans la sequence de la y-zeine. 
5 Les insertions dont il est question ci-dessus sont realisables par les 

techniques disponibles, par exemple par recombinaison des sequences 
ayant subi une ou plusieurs etapes de digestion enzymatique. 

Dans un mode de realisation particulier de I'invention, une proteine 
de reserve choisie enrichie en un acide amine determine est exprimee 

10 dans des cellules vegetates heterologues. En d'autres termes une proteine 
de reserve presente nature! lement dans une plante donnee, est exprimee 
sous forme enrichie en acides amines, dans une autre plante, ou dans une 
autre cellule que celle qui la produit naturellement. 

Outre I'enchainement codant pour une proteine de reserve de plante 

15 et Toligonucleotide de I'invention, les sequences de nucleotides 
recombinantes de I'invention peuvent egalement comprendre un promoteur 
d'expression et, par exemple, un promoteur choisi pour son caractere 
specifique pour I'expression dans certaines parties ou dans certains tissus 
des plantes, ou au contraire un promoteur choisi pour son caractere 

20 constitutif. A titre d'exemple, lorsqu'ils sont specifiques, les promoteurs 
peuvent etre specifiques des semences et/ou d'organes ou de tissus 
determines de plantes. lis peuvent alternativement ou egalement etre 
specifiques d'une phase de croissance, par exemple d'un stade determine 
de la germination. 

25 A I'inverse 1'utiiisation de promoteurs constitutifs permet Texpression 

constante et generate de la proteine de reserve, entrainant une 
competition entre I'expression de la proteine de reserve native lorsqu'elle 
est presente et ia proteine de reserve modifiee. 

A titre d'exemple, des promoteurs avantageux pour la realisation de 

30 Tinvention sont le promoteur de la y-zeine du mais contenu dans la 
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sequence de 1,7 kb se trouvant en amont de la sequence codant 
representee a la Figure 7, le promoteur du virus de la mosaTque du chou- 
fleur, a savoir le promoteur CaMV35S (EP-B-01 31623), le promoteur 
constitute du gene d'actine 1 de riz (PCT/US 9100073) ou le promoteur 
5 semence specifique "High Molecular weight gluthenine" du ble (Colot V. et 
al, 1987, EMBO Journal, vol. 6, p. 3559-3564). 

Le cas echeant, ces promoteurs sont completes par d'autres 
sequences de regulation, et en particulier des activateurs d'expression. 

A titre d'exemple, d'autres promoteurs utilisables pour la realisation 
10 de I'invention sont par exemple le promoteur du gene codant pour la 
proteine de reserve 2S des semences d' Arabidopsis thaliana . ou encore 
les promoteurs de la lectine de haricot ou de la (3-phaseoline de haricot. 

L'introduction supplemental d'activateurs de Pexpression dans les 
sequences de regulation des sequences de nucleotides selon I'invention 
15 permet egalement d'augmenter le niveau de la transcription primaire de la 
sequence de nucleotides et, le cas echeant. d'augmenter la quantite de 
proteines de reserves modifiees produites. Des activateurs sont par 
exemple des introns de monocotyledones tels que I'intron 1 du gene 
d'actine de riz. 

20 L'invention a egalement pour objet un vecteur de clonage et/ou 

d'expression, caracterise en ce qu'il comprend, en un site non essentiel 
pour sa replication, une sequence de nucleotides repondant a I'une des 
definitions precedemment donnees. Des vecteurs particuliers interessants 
dans le cadre de la realisation de I'invention sont par exemple les 

25 plasmides pP20yZ, pH30yZ ou pH45y2. Le plasmide pP20yZ a ete depose 
a la CNCM (Paris, FRANCE) le 31 Octobre 1995 sous le numero 1-1640. 
Le plasmide pH45yZ a ete depose a la CNCM le 31 Octobre 1995 sous le 
numero 1-1639. 
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Entre egalement dans le cadre de I'invention un polypeptide tel 
qu'exprime par une sequence de nucleotides recombinante repondant aux 
definitions precedentes. 

L'expression "polypeptide" n'introduit pas dans le cadre de la 
5 presente invention de limitation particuiiere quant au nombre d'acides 
amines formant le polypeptide. II peut done s'agir de sequences 
comportant quelques acides amines, habituellement designees par 
Texpression "peptide^', ou encore de sequences beaucoup plus longues 
telles que celles des proteines. 
io L' invention concerne a cet egard la y-zeine de maYs modifiee riche 

en lysine, caracterisee en ce qu'elle est codee par une sequence de 
nucleotides recombinante decrite precedemment. 

Selon un mode de realisation prefere de ttnvention, la y-zeine de 
mais modifiee enrichie en lysine est caracterisee en ce que sa sequence 
15 d'acides amines est modifiee par au moins un polypeptide repondant a la 
formule (P-K) n , dans laquelle : 

- n est un entier superieur ou egal a 2, 

- P represente un residu d'acide amine Proline, 

- K represente un residu d'acide amine Lysine, 

20 - le signe "-" symbolise une liaison entre les deux residus d'acides 

amines et notamment une liaison de type peptidique, les n unites (P-K) 
etant liees par des liaisons notamment de type peptidique. 

Selon une variante de realisation de ['invention le polypeptide 
integre dans la sequence d'acides amines de la y-zeine repond a ia 

25 formule K-(P-K) n . 

Les polypeptides de ('invention repondant a I'une des formules (P- 
K) n , K-(P-K) n ou a des variantes sont substitues a une sequence 
naturellement presente dans la y-zeine de maYs normale ou inseres avec 
deletion d'un ou plusieurs acides amines de la sequence d'acid s amines 
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de la y-zeine de mais normale, ou ajoutes dans la sequence d'acid s 
amines de la y-zeine normale, le site d'insertion du polypeptide etant choisi 
de fa?on telle que : 

- lorsque la y-zeine modifiee riche en lysine est produite dans une 
5 cellule hote, notamment une cellule vegetale, elle est localisee de fapon 

identique ou similaire dans cette cellule a la y-zeine de maTs normale qui 
serait produite dans les memes conditions, dans la meme cellule hote 
et/ou, 

- la y-zeine de maTs modifiee est reconnue par des anticorps diriges 
10 contre la y-zeine normale de maTs. 

Les proteines P20yZ representee a la figure 1 1 ou H30yZ ou H45yZ 
representee a la figure 10, sont des exemples preferes de la realisation de 
Tinvention et representent des y-zeines de mais modifiees enrichies en 
lysine. 

15 L'invention a egalement pour objet une cellule hote recombinante, 

caracterisee en ce qu'elle comprend une sequence de nucleotides decrite 
ci-dessus. 

Des cellules hotes interessantes sont par exemple les cellules de 
bacteries, telles que celles de E. coli ou d' Aqrobacteirum tumefaciens . De 
20 preference, dans le cadre de Hnvention, et pour Texpression stable de la 
proteine de reserve modifiee recherchee, on aura recours a des cellules 
hotes d'origine vegetale. 

A titre d'exemple, ces cellules d'origine vegetale sont des cellules 
de semences, de plante, et par exemple, a titre prefere, des cellules 
25 d'endosperme de semences de mais. 

La sequence de nucleotides de Tinvention est de preference 
introduite dans le genome de la cellule hote de fapon stable et dans des 
conditions telles que la proteine de reserve exprimee enrichie en acides 
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amines et en particulier en lysine est localisee comme le serait la prot'ine 
correspondante normale dans la meme cellule hote. 

Des techniques variees sont disponibles pour transformer les 
cellules hotes. A titre d'exemple, et en vue de transformer de facon stable 
5 ou de facon transitoire les cellules hotes, on utilisera les techniques 
d'electroporation, de bombardement de microprojectiles portant de I'ADN, 
via canon a particules, via culture d'explants avec Aqrobacterium 
tumefaciens . par penetration de microfibres. 

Outre les cellules d'endosperme de semences de mais, des cellules 
10 de soja, de tournesol, de tabac, de ble, d'avoine, de luzerne, de riz, de 
colza, de sorgo ou d' Arabidopsis peuvent etre mises en oeuvre pour 
exprimer les sequences de nucleotides de I'invention. 

La presente demande concerne aussi les semences produisant un 
polypeptide tel que decrit ci-dessus et les plantes produisant ce rheme 
is polypeptide. De preference, ces plantes sont du mais. 

L'invention a aussi pour objet des semences obtenues a partir des 
plantes transformees exprimant le polypeptide de l'invention, en d'autres 
termes le polypeptide de reserve modifie enrichi en acides amines 
determines. 

20 Selon un mode de realisation particulierement interessant de 

I'invention, les proteines y-zeines modifiees enrichies en lysine, sont 
exprimees chez des mutants opaque 2 de mais. Ces mutants p2 decrits 
par Emerson R.A. et al. (1935, Cornell Univ. Agric. Exp. Stn. Mem. 180) et 
caracterises par Mertz E.T. et al (1964, Science 145 : 279-280) voient leur 

25 contenu en lysine fortement augmente ameliorant ainsi fortement les 
qualites nutritives du mais (compensation de son bas niveau en cet acide 
amine essentiel). Les mais classiques ont un contenu en lysine d'environ 
0,24% du produit brut (poids du grain total), les mais opaque 2 avoisinent 
en revanche 0,5% en lysine. Cependant ils presentent des caracteristiques 

30 agronomiques insuffisantes car leur endosperme est beaucoup moins 
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vitreux et se revele tres friable (phenotype n starchy M ). Ceci a pour effet de 
les rendre extremement sensibles aux organismes pathogenes et aux 
etapes de traitements post recolte. Ce phenotype est en fait du a la 
diminution importante de certaines proteines de reserve en particulier les 
5 alpha zeines. En fait Opaque 2 code pour un facteur de transcription 
necessaire a I'expression de certains genes zeines (Schmidt R. J. et al, 
1990, Proc. Natl. Acad. Sci. USA 87, 46-50). 

Des derives opaque 2 n'ayant plus les inconvenients precites ont 
ete developpes par amelioration genetique classique l il s'agit des maTs 

10 QPM (Quality Protein Maize). L'analyse genetique recente de ces maTs 
(Lopes M.A. et al, 1995, Theor. Appl. Genet. 19, 274-281) a montre que 
seulement 2 ou 3 loci se revelent clefs dans ces modifications favorables. 
Des analyses genetiques et biochimiques plus poussees permettent de 
postuler qu'un des 3 loci responsables est le locus y-zeine : les genotypes 

15 de maTs qui portent une duplication de ce gene situe dans la region 
centrometrique du chromosome 7, s'averent tous etre des modificateurs 
d'opaque 2 (Lopes M.A. et al, 1995, Mol. Gen. Genet. 19 : 247 : 603-613). 

La presente invention permet de preparer des mais mutants opaque 
2 a partir de mats n'ayant qu'un gene de y-zeine au niveau du chromosome 

20 7, qui sont complements par Taddition tfune sequence recombinante 
codant pour une y-zeine de maTs enrichie en lysine. Outre le fait d'acquerir 
les proprietes de durete similaire a un mais opaque 2 non mutant, il a 
I'avantage d'augmenter significativement la teneur en lysine, depassant 
ainsi celle des mais QPM. 

25 La presente invention permet d f obtenir des mutants opaque 2 de 

mais modifies, dans lesquels on a insere une sequence de nucleotides 
recombinante codant pour une y-zeine de mais enrichie en lysine. 
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Uinvention a aussi pour objet un precede pour Tobtention de plantes 
ou de semences exprimant une proteine de reserve modifiee, caracterise 
en ce qu'il comprend ies etapes de : 

a) transformation d'une cellule vegetale, avec une sequence de 
5 nucleotides ou un vecteur ci-dessus decrits, dans des conditions 

permettant ('expression de fa?on stable et fonctionnelle de la proteine de 
reserve modifiee codee par la sequence de nucleotides; 

b) regeneration de plantes a partir de la cellule de plante 
transformee de I'etape a), pour obtenir des plantes exprimant la proteine 

10 de reserve modifiee, 

c) le cas echeant, obtention des semences a partir des plantes 
modifiees obtenues a I'etape b). 

Selon un mode de realisation avantageux de I'invention, la plante 
transformee est le mais et la proteine de reserve modifiee enrichie est la y- 
15 zeine, enrichie en lysine. 

Uinvention concerne aussi Ies plantes obtenues par mise en oeuvre 
d'un tel procede. 

Afin d'evaluer la teneur en un acide amine donne des plantes selon 
Tinvention, il est possible d'utiliser un protocole de dosage tel que celui 
20 mentionne dans Zarkadas et al, 1995, J. Agri. Food. Chem. Vol. 43: pages 
84-93. 

D'autres caracteristiques et avantages de Tinvention apparaissent 
dans Ies exemples et dans Ies figures qui suivent. 

25 Figure 1 

- Carte de restriction du plasmide pP20yZ 

Figure 2 

- Carte de restriction du plasmide pH45yZ 
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Figure 3 

Representation schematique des proteines codees par les genes 
modifies et non modifies de la y-zeine : y-zeine de type sauvage (yZ), y- 
5 zeines riches en lysine (P20yZ 1 H30yZ, H45yZ et N13yZ) resultant de 
Tinsertion des oligonucleotides codant pour des sequences riches en 
lysine. La sequence d'acides amines des polypeptides inseres est indiquee 
en utilisant le code a une lettre de la representation des acides amines. 
Les abreviations suivantes sont utilisees : 
10 Term : terminal ; 

ProX DOMAIN : domaine linker proiine-Xaa. 

Figure 4 - Analyse in-vitro des y-zeines riches en lysine. (A) traduction tn- 
vitro et translocation des transcrits correspondant aux y-zeines modifiees 

15 riches en lysine; lignes 1, 5, 9 et 13: produits complets de traduction; 
lignes 2, 6, 10 et 14 : produits complets de traduction apres translocation 
dans les microsomes canins (CM); lignes 3 t 7, 11 et 15: produits de 
translocation resistant a Taction de la proteinase K (PK) ; lignes 4 ( 8, 12 et 
16 : totalite des produits de traduction apres traitement avec la proteinase 

20 K en presence de 0,5% Nonidet P40(NP40). (B) Immunoprecipitation des 
produits de traduction in-vitro correspondant aux proteines y-zeine et y- 
zeine modifiees riches en lysine, en utilisant I'anti-s6rum aPL. Ligne 1 : y- 
zeine ; ligne 2 : P20yZ ; ligne 3 : H30yZ ; ligne 4 : H45yZ et ligne 5 : N13yZ. 
(C) Meme legende que pour (B) mais en utilisant Tantiserum aG2. Les 

25 marqueurs de poids moleculaire (en kilodaltons) sont indiques sur la 
gauche. 

Figure 5 - Activite tissu-specifique du promoteur de la y-zeine. Des 
endospermes de mais, des embryons et des feuilles de mais ont ete co- 
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transformes par bombardement de particules en utilisant les constructions 
representees dans Is figure (dans la partie a droite). Les activites relatives 
de la iuciferase (LUC, colonnes en gris) et de la B-glucuronidase (GUS, 
colonnes hachurees) ont ete exprimees sous la forme d'un multiplicateur 
5 des valeurs obtenues avec des projectiles nus + la deviation standard des 
differents ratios. 

Figure 6 - Expression des y-zeines riches en lysine dans Pendosperme 
sub-aleuronique des cellules. (A) Immunoblot avec un antiserum aPL, de 

10 proteines extraites d'endospermes transformes par pN13yZ (iigne 2), 
pH45yZ (Iigne 3), pH30y2 (Iigne 4) et pP20yZ (Iigne 5). Le temoin (Iigne 1) 
correspond a des endospermes non transformes. Les marqueurs de poids 
moleculaire (en kilodaltons) sont indiques a gauche. (B) expression des 
transcrits H45yZ et N13yZ dans des endospermes transformes de facon 

15 transitoire. Les cDNAs obtenus a partir des tissus transformes avec 
pH45yZ (Iigne 2), pN13yZ (Iigne 3) et ie temoin (Iigne 1) ont ete amplifies 
par PCR et analyses en utilisant un oligonucleotide synthetique codant 
pour une sequence riche en lysine a titre de sonde. 

20 Figure 7 - Accumulation de y-zeines riches en lysine dans les corps 
proteiques de I'endosperme. (A) Analyse en immunoblot en utilisant 
Pantiserum ctPL, des corps proteiques isoles a partir des endospermes 
transformes avec pP20yZ (Iigne 1), pH30yZ (Iigne 2), pH45yZ (Iigne 3), 
pN13yZ (Iigne 4) et aucun ADN (Iigne 5). (B) Analyse en immunoblot en 

25 utilisant Pantiserum aPL, des corps proteiques isoles a partir des 
endospermes transformes avec pP20yZ, pH30yZ et pH45yZ et digeres 
avec la proteinase K en presence d'un tampon isotonique (Sucr., lignes 1, 
3 et 5) ou un tampon hypotonique (H 2 0, lignes 2, 4 et 6). Les marqueurs 
de poids moleculaire (en kilodaltons) sont indiques sur la gauche. 
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Figure 8 - Co-localisation des proteines P20yZ avec les a- et les y-zeines 
dans les corps proteiques de I'endosperme du mais. Une analyse en 
immunocytochimie a ete realisee sur des sections ultrafmes en utilisant 
5 des anticorps aPL (marques avec des particules d'or de diametre 15 nm) 
et des anticorps aZ et aG2 (marques avec des particules d'or de 5 nm). (A) 
Corps proteiques de I'endosperme transforme avec pP20yZ ( 
immunomarques avec Tanticorps aPL (B) Immunolocalisation de P20yZ 
(marque avec des particules d'or de 15 nm) et de I'a-zeine (marquee avec 

10 les particules d'or de 5 nm) dans les corps proteiques isoles a partir des 
endospermes transformes avec pP20yZ. (C) et (D) Immunolocalisation de 
P20yZ (marque avec les particules d'or de 15 nm) et des y-zeines 
(marquees avec les particules d'or de 5 nm) dans des corps proteiques 
isoles a partir des endospermes transformes avec pP20yZ. Les fleches 

15 indiquent les sections tangentielles des corps proteiques. 

Figure 9 - Sequence codante de FADNc la y-zeine de mais et la sequence 
d'acides amines correspondante. 

20 

Figure 10 - Sequence codante de I'ADNc de la H45yZ zeine de mais et la 
sequence d'acides amines correspondante. 

La sequence riche en lysine (28 acides amines) est introduite 
entre les residus d'acides amines 92 et 1 19 de la sequence representee a 
25 la Figure 1 0. 

Figure 11 - Sequence codante de I'ADNc de la P20yZ zeine de mais et la 
sequence d'acides amines correspondante. 
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La sequence riche en lysine (14 acides amines) est introduce 
ntre les residus decides amines 92 et 1 19 de la sequence representee a 
la Figure 11. 

5 Figure 12 - Cartes de restriction des plasmides pBin 19P20yZ et 
pBin 19H30yZ. 

Figure 13 - Endospermes des plants de mais transgeniques accumulant 
de la y-zeine enrichie en lysine 
10 A et B: SDS-page et Immunoblot utilisant I'antiserum aPL 

A) 10 pg de proteine par piste (transformants avec le construit H45yZ) 
piste C: extrait proteique des endospermes d'hybrides B73xA188 (controle) 
piste 1 : A1 

15 piste 2: B1 
piste 3: B2 
piste 4: C1 
piste 5: D1 
piste 6: D2 

20 

B) 1 pg de proteine par piste (transformants avec le construit P20yZ) 
piste C: controle 

piste 1 : A1 
piste 2: A2 
25 piste 3: B1 
piste 4: C1 
piste 5: D1 
piste 6: E1 
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10 



15 



C) SDS-page et coloration a ('argent (3 transformants P20y2 et 3 
transformants H45y2) 
piste 1 : A2 
piste 2: B1 



piste 9: D2 

Figure 14 - Contenu en y-zeine enrichie en lysine par grain (transformants 
45yZ B1 et C1) 

A: coloration a I'argent; 10 ug de proteine par piste 

B: immunoblot utilisant I'antiserum aPL; 1 ug de proteine par piste 

Pistes 1 a 5: extraits proteiques de drfferents endospermes du transformant 

45yZ B1 

Pistes 6 a 10: extraits proteiques de differents endospermes du 
transformant 45yZ C1 . 

Figure 15 : 

A) Contenu en y-zeine enrichie en lysine de 10 grains (transformant 45yZ 
C1 ) utilisant I'antiserum aPL 1 ug de proteine par piste 

Pistes 1 a 10: extrait d'endospermes de 10 descendants 



piste 3: C1 
piste 4: B1 
piste 5: A1 
piste 6: D2 



1 0 ug de proteine par piste 



piste 7: B1 
piste 8: A1 



40 ug de proteine par piste 



20 
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B) Immunoblot des extraits proteiques des endospermes 1 a 5 presents en 
A) et marques avec I'antiserum aG2; 2 ug de proteine par piste. 

EXEMPLES 

5 

A) Preparation de v-zeines modifiees enrichies en lysine et 
expression de ces proteines modifiees suivte de faccumulation dans 
les corps proteiques des cellules de I'endosperme de ma'is 

io La y-zeine est une proteine de reserve du ma'is, ayant un poids 

moleculaire de 28 kD, riche en soufre, qui est accumulee dans les cellules 
de I'endosperme avec les a-et G-zeines, dans les corps proteiques derives 
du reticulum endoplasmique granulaire (ER) (Ludevid et a!., 1984, Plant 
Mol. Biol. 3, 227-234 ; Lending et al., 1984. Plant Cell 1, 1011-1023). La 

15 sequence d'acides amines deduite de la sequence nucleotidique de I'ADNc 
(Prat et al., 1985, Nucl. Acids Res. 13, 1493-1504) et des clones 
genomiques (Boronat et al., 1986, Plant Sci. 47, 95-102) montre que la y- 
zeine n'a pas d'homologie avec les polypeptides de type a-zeine. Bien que 
la y-zeine soit codee par 1 a 2 genes par genome haploide (Boronat et al., 

20 1986, Plant Sci. 47, 95-102) elle represente 10-15% de la totalite des 
proteines des endospermes de ma'is. L'expression du gene de la y-zeine 
dans des systemes heterologues tels que les oocytes de Xenopus (Torrent 
et.al., 1994, Planta 192, 512-518) et dans Arabidopsis thaliana (Geli et al., 
1994, Plant Cell 6, 1911-1922), indique que les polypeptides de y-zeine 

25 restent stables et sont capables en tant que tels de former des corps 
proteiques derives du reticulum endoplasmique, a I'interieur des cellules. 
De plus, des analyses par deletion de differents domaines structuraux de 
la y-zeine ont montre que la sequence N-terminale incluant le domaine 
repete riche en proline, etait responsable de la retention de la y-zeine dans 
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le reticulum endoplasmique et que le domaine C-terminal riche en cyst 'in 
etait responsable de la formation des corps proteiques. Le domaine Pro-X 
ne semblait pas en relation avec la stabiiite de la proteine ni avec sa 
localisation ciblee (Geli et al., 1994, Plant Cell 6, 1911-1922). 
5 Materiel et Methodes 

Materiel vegetal 

Apres une sterilisation en surface (1) des graines de stade 17 DAP 
10 ("days after pollination") du mais de selection W64A ont ete dissequees a 
la main et la couche de pericarpe et d'aleurone a ete separee des 
endospermes. Des sections tangentielles ont ete realisees de facon a 
exposer une grande partie de la surface sub-aleuronique. Si necessaire, 
des embryons ont ete isoles et des feuilles de plantes agees de 7 jours ont 
15 ete dissequees pour retirer le tissu epidermique. Apres dissection, les 
echantillons ont ete deposes dans des boites de Petri sur des papiers 
filtres humidifies avec un milieu MS (Murashige et Skoog, 1 962, Physiol. 
Plant. 15, 473-497). 

20 Constructions plasmidiques 

Un premier ensemble de plasmides, pKSG2, pHbP2, pPbP4 et 
pNaN1 a ete obtenu pour permettre I'introduction de sites de restriction 
dans le gene codant pour la y-zeine. pKSG2 et pHbP2 ont ete construits 
25 selon la description decrite dans la publication de Torrent M. et al. (Planta 
(1994) 192 : 512-518). Le plasmide pKSG2 contient la sequence codante 
de la Y-zeine. 
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Le piasmide pHbP2 est obtenu a partir de pKSG2 et contient une 
sequence codant pour une y-zeine mutee dans laquelle le domaine Pro-X 
de la proteine a ete delete. 

Le piasmide pPbP4 a ete obtenu a la suite de deux etapes de 
5 clonage : (i) le fragment de restriction Sall-Pvull de 350 pb de pKSG2 a ete 
clone dans un piasmide Bluescript (pBSKS, Stratagene, La Jolia, California 
USA) restraint avec Sail et EcoRV (pKSC4) et (ii) le fragment de restriction 
Pvull-Xbal de 600 bp de pKSG2 a ete clone dans les sites de restriction 
Smal-Xbal de pKSC4. La nouvelle construction pPbP4 contient un site de 
10 restriction utile EcoRI juste avant le domaine P-X de la sequence codante 
de la y-zeine. 

Le piasmide pNaN1 a egalement ete obtenu a I'aide de deux etapes 
de clonage : (i) le fragment Nael-Xbal de 250 bp de pKSG2 a ete clone 
dans le piasmide pBSKS restreint avec EcoRV-Xbal (pKSC8) et (ii) le 
15 fragment de restriction Nael-Hindlll de 700 bp (a extremites franches) de 
pKSG2 a ete clone dans le site de restriction Hindlll de pKSC8. La 
nouvelle construction, pNaN1, contient des sites de restriction Clal et 
Hindlll a une position situee a 15 nucleotides avant le codon stop de la y- 
zeine. 

20 Deux oligonucleotides synthetiques repondant aux sequences 

suivantes : SEQ ID N°1 : 

5' C G ATG AATTC AAAC C AAAG C C AAAG C C G AAGC C AAAAG AATTC A3* , et 
sa sequence inverse appelee SEQ ID N°2, dont la sequence est la 
suivante : 

25 S'AGCTTGAATTCTTTTGGCTTCGGC 

codant pour des sequences riches en Jysine designees K(P-K) 4l ont 6te 
hybridees, digerees avec EcoRI et donees dans un site EcoRI de pHbP2 
et pPbP4. Trois clones ont ete selectionnes : pPo2 et pHo3 contenant la 
sequence codante pour K(P-K) 4 et pHo4 comprenant la forme tronquee de 
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la sequence codant pour le y-zeine contenant un tandem 2K(P-K) 4 (sous 
forme d'une sequence K (P-K) 4 EF K(P-K) 4 ) de la sequence codant riche 
en lysine. Les memes oligonucleotides hybrides ont ete digeres avec des 
enzymes Clal-Hindlll et clones dans le plasmide pNaN1 restreint a I'aide 
5 des memes enzymes. Le clone selectionne, pNol, contenait la sequence 
codant pour la sequence riche en lysine K(P-K) 4 a I'extremite N-terminale 
de la y-zeine modifiee correspondante. 

Pour la transformation transitoire de t'endosperme, les sequences 
codant pour la y-zeine modifiee de pPo2 et pHo3 ont ete inserees sous la 
10 forme de fragments Hinclf-Nhel dans des sites Smal-Xbal de pDH51 
(Pietrzah et al., 1986, Nucl. Acid Res. 14, 5857-5868) contenant le 
promoteur 35S du virus de la MosaTque du chou-fleur (CaMV). Le 
promoteur pP20yZ obtenu par I* insertion decrite ci-dessus des fragments 
Hincll-Nhel dans les plasmide pDH51 contient la sequence codante de la 
is y-zeine enrichie en lysine (Figure 8) ainsi que les signaux de la sequence 
35S du virus CaMV pour la formation de I'extremite 3' et la polyadenilation. 
La sequence codante chimere P20yZ a ete construite a partir de la region 
codante de la y-zeine contenu dans le plasmide pKSG2 apres differentes 
etapes de clonage. Le promoteur de 1,7 kb de la y-zeine (Reina et al. 
20 1990, Nucl. Acids Res. 18, 6426) a ete insere dans les extremites franches 
d'un fragment Hindll-Pvul dans pHo4 et pNol restreints avec Xho1 et 
obtenus avec des extremites franches. Les constructions pH45yZ et 
pN13yZ ont respectivement ete obtenues. 

Les nouvelles constructions, respectivement appelees pP20y-Z, 
25 pH30yZ, pH45yZ et pN13yZ ont ete utilisees dans des experiences de 
bombardement biolistique. 

Pour etudier la specificite de differents promoteurs vis-a-vis des 
tissus vegetaux, deux constructions, p1.7yZGUS et pCaMV35SLUC ont ete 
utilise s. p1.7yZGUS a ete obtenue par insertion du promoteur de la y- 
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zeine de 1,7 kb (Hindlll-Pvul) dans un plasmide derive de pPuC18 
contenant le gene GUS et les signaux NOS de polyadenilation en 3' de pBI 
101.1 (Jefferson et al., 1987, Embo. J. 6. 3901-3907). pCaMV35SLUC a 
ete obtenu par insertion du gene codant pour la luciferase (LUC) de 
5 pAHC18 (Bruce et al., 1989, P. H. 86, 9692-9696) dans le polylinker de 
pDH51 (Pietrzak et al., 1986, Nucl. Acids Res. 18, 6426). 

Analyse in-vitro 

10 Les plasmides derives de pBSKS contenant les sequences 

codantes de la y-zeine (pKSG2) et de la y-zeine riche en lysine (pPo2, 
pHo3, pHo4 et pNol) ont ete transcrits in-vitro suivant les protocoles 
standards (Sambrook et al., 1989, Molecular Cloning: A laboratory manual, 
2nd. ed., Cold Spring Harbor Laboratory Ed.. Cold Spring Harbor. New 

15 York). Une traduction in-vitro et une translocation des transcrits 
synthetiques ont ete realisees selon la technique de Torrent et al., (1994, 
Panta 192, 512-518), a I'exception des microsomes canins (CM) utilises 
qui provenaient de Promega (Madison. Wis., USA). L'immunoprecipitation 
des produits traduits a ete effectuee essentiellement selon la methode de 

20 Borgese et Gaetani (1980) en utilisant un serum de lapin anti-y-zeinea-G2, 
(Ludevid et al., 1985, Plant Sci. 41, 41-48) et un antiserum aPL. aPL est 
un antiserum polyclonal de lapin obtenu contre le peptide synthetique 
EFK(P-K) 8 EF. Ce peptide a ete synthetise en utilisant la technique de 
synthese en phase solide decrite par Celma et al., 1992. 

25 

Bombardement de microprojectile 

L'ADN plasmidique a ete absorbe sur des particules d'or (1,0 pm, 
Bio-Rad, Lab., Richmond, CA, USA) selon un protocole decrit par Kikkert 
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(Plant Cell, 33:221-226, 1993). Toutes les cibles ont ete bombardees deux 
fois, en utilisant le dispositif BioRad Biolistic PDS-1000/He. Les cibles ont 
ete positionnees 8 cm derriere un ecran stoppant les macroporteurs, qui 
etaient positionnes 1 cm au dessous du disque de rupture 900 PSI. Apres 
5 le bombardement, les echantillons ont ete incubes pendant 24 heures a 
26°C dans le noir. Des temoins etaient constitues par les cibles 
bombardees avec des microprojectiles depourvus d'ADN. 

Tests enzymatiques 

10 

Les tissus bombardes avec les plasmides p1.7yZGUS et 
pCaMV35SLUC ont ete homogeneises sur de la glace dans un tampon 
contenant 25mM Tris, a pH 7,8, 2mM DTT, 10% de glycerol et 1% de 
Triton X-100. Apres centrifugation a 12.000xg pendant 5 minutes, les 

is surnageants ont ete decantes et la proteine soluble totale dans les extraits 
a ete quantifiee en utilisant le test Bradford (Bio-Rad). L*activite GUS a 6te 
testee par analyse fluorimetrique selon la description de Jefferson (1987) 
en utilisant la 4-methyl-ombelliferyl-S-D-glucuronide (MUG) en tant que 
substrat. Uactivite LUC a ete determinee en utilisant le systeme de test 

20 Luciferase (Luciferase Assay System Kit) diffuse par Promega, selon les 
instructions du fabriquant. 

Extraction des proteines de reserve et analyse sur gel des proteines 

25 Des endospermes transformes avec pP20yZ, phteOyZ, pH45yZ et 

pN13yZ ont ete reduits a Tetat de farine -et les a-zeines ont ete extraites a 
Taide de trois series de solvants contenant 70% d'ethanol. La farine 
residuelle a ete sechee a Pair, et les proteines totales ont ete extraites 
avec un tampon contenant 0 t 25 M Tris-HCi ( pH 6,8, 4% sodium dodecyl 
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sulfate (SDS) et 5% 2-mercaptoethanol, pendant 1 heure a temperatur 
ambiante. Les extraits de proteines ont ete analyses par SDS-PAGE et 
immunoblot selon la description de Ludevid et al., 1985. Des feuilles de 
nitrocellulose ont ete intubees avec de I'antiserum aPL (dilution 1:500) et 
5 de la peroxidase de Raifort conjuguee avec un anticorps secondaire (ECL 
Western Blotting System, Amersham, Buckinghamshire, UK) a ete utilisee 
pour la detection de la proteine. 

Analyse de ['expression de TARN 

10 

L'ARN total a ete extrait selon la description de Logemas et al., 
1987. L'ADN complementaire (ADNc) a ete prepare en utilisant la 
transcriptase reverse et l'o!igo dT de Gibco BRL (Gaithersburg, MD, USA) 
selon les instructions du fabriquant et cet ARN a ete amplifie par une 

is reaction de PGR. Des oligonucleotides amorces utiiisees pour la PCR 
etaient des sequences de 20-mers correspondant aux extremites de 5* et 3' 
du gene de structure de la y-zeine. Des protocoles standards ont ete 
utilises pour la preparation de sondes marquees au 32 P, et pour Panalyse 
sur gel de I'ADN (Sambrook et al. Molecular Cloning: A laboratory manual, 

20 2nd. ed., Cold Spring Harbor Laboratory Ed., Cold Spring Harbor, New 
York) en utilisant un oligonucleotide synthetique codant pour une 
sequence riche en lysine (voir ci-dessus) en tant que sonde. 

Isolement des corps proteiques et traitement par ia protease 

25 

Ces protocoles ont ete decrits anterieurement (Torrent et al. ( Planta, 
180:90-95,1989). 



Microscopte 'I ctronique 
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Les corps proteiques des endospermes de type sauvage et des 
endospermes transformes par pP20y2 ont ete fixes avec 2,5% de 
paraformaldehyde dans 20 mM de tampon phosphate a pH7,2, pendant 1 
5 heure a temperature ambiante, et transformes selon la description de Geli 
et al.. 1994, Plant Cell 6, 1911-1922) toutefois en utilisant un antiserum a- 
PL et un colloide d'or et de proteine A ayant un diametre de 15 nm. Pour le 
marquage double, des sections ultra fines ont ete d'abord incubees avec 
a-PL et le colloide d'or et de proteine A (15 nm de diametre) a ete utilise 

10 pour la detection de Tanticorps. Apres lavage, des sections ont ete 
incubees avec 0,15mg/mL de proteines A pendant 20 minutes pour 
saturer les immunoglobulines et finalement les grilles ont 6te incubees 
avec des sera ct-G2 ou d-Z1 et le colloide or/proteine A (diametre 5 nm) a 
ete utilise pour la detection de I'anticorps. a-Z1 est un antiserum polyclonal 

15 de lapin dirige contre Ta-zeine obtenu selon la description de Ludevid et 
al., 1985, Plant Sci. 41, 41-48). 

Resultats 

20 Construction des v-zeines riches en lysine 

Les inventeurs avaient demontre Pimportance du domaine repete 
riche en proline ("proline-rich repeat") et du domaine C-terminal riche en 
cysteine pour la retention dans le reticulum endoplasmique de la y-zeine et 
25 la formation des corps proteiques contenant ces proteines dans les 
cellules de feuilles de Arabidoosis (Geli et al,, 1994, Plant Cell 6 f 1911- 
1922). Sur la base de ces resultats prealables, la possibility d'inserer des 
sequences riches en lysine dans differents domaines de la y-zeine, de 
fapon a creer une y-zeine modifiee correctement ciblee et accumulee dans 
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les cellules de I'endosperme a ete investiguee afin d'ameliorer I s qualites 
nutritives du mats. 

Les inventeurs ont a present construit des genes modifies de la y- 
zeine par introduction ^oligonucleotides synthetiques codant pour des 
5 sequences riches en lysine dans differents sites de la sequence codante 
de la y-zeine. Des constructions de y-zeine modifiee ont ete creees de 
facon a eviter le placement des sequences codantes riches en lysine dans 
les domaines constitues par le domaine repete en tandem et le domaine 
riche en cysteine. Des modifications de la sequence codante de la y-zeine 

10 ont ete effectuees dans la sequence correspondent au domaine Pro-X. De 
plus pour minimiser une alteration du repliement de la proteine, les 
sequences riches en lysine (P-K) n ont ete definies pour imiter la sequence 
du domaine Pro-X. Comme on le voit sur la figure 3, dans la proteine 
P20yZ, une sequence K(P-K) 4 a ete introduite apres la region Pro-X et 

15 dans la proteine H30yZ et dans la proteine H45yZ, des sequences d'acides 
amines comprenant K(P-K) 4 et 2K(P-K) 4 respectivement remplacent le 
domaine Pro-X de la y-zeine (yZ, Fig. 3). Pour explorer si I'extremite C- 
terminale etait un site permissif pour I'introduction des sequences riches 
en lysine, une proteine supplemental N13yZ, a ete creee par I'insertion 

20 d'une sequence contenant K(P-K) 4 cinq -acides amines en amont de 
I'extremite C-terminale (Fig.3). 

Activite du pro moteur de la y-zeine dans I'endosperme de mais transforms 

25 Pour determiner si des y-zeines riches en lysine pouvaient etre 

exprimees dans les cellules de I'endosperme, on a tout d'abord recherche 
un promoteur efficace et un systeme de transformation. Un promoteur de la 
y-zeine, contenant une sequence en amont de 1,7 Kb (Reina et al., 1990, 
Nucleic Acid Research, vol. 18, p. 6426) et le promoteur CaMV contenant 
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625 bp de la sequence en amont de la proteine 35S du virus de ia 
mosaique du chou-fleur CaMV ont ete testes. Jusqu'a pres nt aucune 
information n'etait disponible sur I'analyse fonctionnelle des fusions de 
genes avec le promoteur de la y-zeine dans les plantes monocotyledones 
5 transgeniques. Pour analyser I'activite et la specificite tissulaire du 
promoteur de la y-zeine, deux genes chimeres ont ete construits (voir 
Fig. 5). I'expression transitoire par bombardement biolistique (Klein et al., 
1988 PNAS 85:4305) a ete utilisee comme procedure de transformation du 
maTs pour ('analyse du promoteur aussi bien que pour des experiences 
10 d'expression des y-zeines riches en lysine. Des endospermes de mal's de 
stade 17 DAP (17 jours apres la polynisation) (le pericarpe et les cellules 
de ia couche aleuroniques etaient enleves), des embryons (17 DAP) et des 
jeunes feuilles (agees de 10 jours) ont ete bombardes avec des projectiles 
d'or recouverts avec de I'ADN plasmidique contenant les deux 
15 constructions. La figure 5 montre les activites de fJ-glucuronidase (GUS) et 
de iuciferase (LUC) presentes dans les trois tissus testes : I'endosperme, 
I'embryon et la feuille, par rapport a I'experience temoin. On doit noter que 
les resultats correspondent a la moyenne d'au moins 3 experiences 
independantes realisees. Toute I'activite GUS sous le controle du 
20 promoteur de la y-zeine etait restreinte a I'endosperme, puisque aucune 
expression GUS n'a ete detectee dans I'embryon et dans les feuilles. De 
plus, les endospermes bombardes ont ete colores histochimiquement pour 
determiner le nombre de clusters cellulaires exprimant la proteine GUS. Le 
profil de coloration corroborait les resultats precedents, GUS etait 
25 fortement exprime dans les endospermes (le nombre moyen de clusters 
cellulaires colores pour GUS par endospermes etait de 150) et des taches 
bleues n'ont pas ete detectees dans I'embryon et dans les feuilles. Au 
contraire, le promoteur CaMV 35S conferait une activite LUC dans tous les 
tissus testes (Fig. 5), mais il existait cependant des differences 
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quantitatives entre I'activite relative de I'enzyme dans les feuilles et les 
embryons par rapport a Tendosperme. Ces differ nces pouvaient 6tre 
attributes a une variability intrinseque de I'activite constitutive du 
promoteur CaMV parmi les differents tissus du mais ou a une penetration 
5 faible des particules recouvertes par I'ADN dans les cellules mesophyles 
contenant un systeme vacuolaire important. II existe dans Tetat de la 
technique des preuves selon lesquelies le promoteur CaMV aurait 
habituellement une faible activite dans des cellules de plantes 
monocotyledones (Fromm et al., 1985, Proc. Natl. Acad. Sci. USA 82, 

10 5824-5828) ; cependant les inventeurs ont montre ici une forte activite du 
promoteur CaMV dans des cellules d'endospermes. Ceci a permis de 
conclure que I'activite du promoteur de la y-zeine et du promoteur CaMV 
35S etait tres forte dans les endospermes de mais et ainsi que les deux 
promoteurs pouvaient s'averer utiles pour controler Texpression de la 

15 proteine dans ce tissu. 

Pour determiner si les proteines mutantes codees par les 
constructions etaient competentes avec la fonction de translocation 
membranaire, des experiences de transcription-traduction in-vitro ont ete 
effectuees en presence de microsomes pancreatiques de chien. Les 

20 transcrits synthetiques de chaque construction ont ete traduits et la 
translocation a travers les membranes de microsomes testee par 
recherche de la protection vis-a-vis de ia digestion par la proteinase K 
Ces resultats rapportes a ia figure 4A indiquent que les poids moleculaires 
apparents des polypeptides synthetises in-vitro refletent les mutations 

25 introduces (Fig. 4A, lignes 1, 5, 9 et 13). En presence des microsomes et 
de la proteinase K, des peptides de faible poids moleculaire n'ont pas ete 
observes, indiquant que les chaines polypeptidiques completes des y- 
zeines modifiees etaient transportees a travers les membranes 
microsomales (Fig. 4A ( lignes 3, 7, 11 et 15). En comparant le resultat de 

30 la translocation des quatre y-zeine modifiees, on a observe que la proteine 
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H45yZ (Fig. 4A, ligne 11), qui contenait I'insertion de 10 acides amines de 
type lysine, a subi une translocation inferieure aux autres proteines. il 
semble que les residus charges negativement pourraient interferer dans 
une certaine proportion, sur I'efficacite de la translocation. L'anticorps 
5 polyclonal aG2, dirige contre la y-zeine (Ludevid et al., 1985, Plant Sci. 41, 
41-48) ne pouvant etre utilise pour distinguer entre la y-zeine de type 
sauvage et les y-zeines modifiees, un anticorps aPL dirige contre un 
peptide synthetique contenant la sequence d'acides amines riche en lysine 
a ete prepare. On a ensuite teste si des proteines modifiees synthetisees 

10 in-vitro etaient reconnues a la fois par les anticorps aG2 et les anticorps 
aPL Cette experience est illustree a la figure 4B et la figure 4C ou les 
transcrits synthetiques de la y-zeine, P20yZ, H30yZ, H45yZ et N13yZ 
etaient traduits in-vitro et dans lesquels les produits de traduction etaient 
immunoprecipites avec aPL (Fig. 4B) et avec aG2 (Fig. 4C). Ces resultats 

15 indiquent que les y-zeines riches en lysine etaient reconnues par les deux 
anticorps (voir Fig. 4B et C, lignes 2 a 5) et que la y-zeine etait seulement 
reconnue par le serum aG2 (Fig. 4B et C, ligne 1). Ainsi la specificite des 
anticorps aPL pour les proteines modifiees a permis de distinguer la y- 
zeine riche en lysine de la y-zeine endogene lorsque les genes modifies 

20 etaient exprimes dans des cellules d'endosperme. Prises ensemble ces 
experiences ont montre que la presence de sequences riches en lysine ne 
perturbait pas la fonction de translocation a travers la membrane ou le 
comportement immunologique de la y-zeine. 

25 Analyse de ['expres sion des v-zeines enrichies e n lysine dans les 
endospermes de maTs 



Pour explorer si la y-zeine modifiee riche en lysine etait exprimee et 
accumulee dans les cellules de I'endosperme, des graines de stade 17 
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DAP ont ete bombardees avec I'ADN contenant les sequences codant pour 
la proteine des quatre constructions (Fig. 3) sous le controie du promot ur 
CaMV (P20yZ et H20yZ) et du promoteur de la y-zeine (H45yZ et N13yZ). 
Des constructions comprenant des promoteurs anti-sens ou depourvues de 
5 promoteurs ont ete utilisees comme temoins. Apres 24 heures de 
transformation des endospermes, les proteines totales ont 6te extraites et 
I'expression des y-zeines modifiees a ete testee par immunoblotting en 
utilisant I'anticorps aPL. La figure 6A montre que les genes chimeres de y- 
zeine contenant I'insertion (Pro-Lys) n apres le domaine Pro-X (P20yZ) ou le 

10 remplagant (H45yZ et H30yZ) etaient fortement exprimes et les produits de 
traduction accumules efficacement dans les cellules de I'endosperrne 
(Fig. 6A, lignes 3, 4 et 5). Pour chaque ligne, les extraits de proteine 
correspondaient a 1/3 environ d'un endosperme bombarde, ce qui a permis 
d'estimer que la quantite de proteines modifiees P20yZ ( H30yZ et H45yZ 

is par endosperme atteignait le niveau du nanogramme. De plus, aucune 
difference quantitative entre les niveaux depression des genes chimeres 
sous le controie du promoteur CaMV et du promoteur de la y-zeine n'a ete 
observee, confirmant les resultats ci-dessus decrits obtenus avec les 
proteines marqueurs GUS et LUC (Fig. 5). On a note que I'anticorps aPL 

20 reconnaissait une proteine d'environ 30 kD presente dans les extraits de 
proteine totale, meme dans les endospermes non transformes (voir la 
bande faible presente dans les quatre lignes de la figure 6A). Une 
procedure d'extraction sequentielle de proteine a permis de constater que 
cette proteine n*etait pas une proteine de reserve soluble dans un milieu 

25 aqueux. 

Comme le montre la figure 6A (ligne 2) aucune trace de la proteine 
N13yZ n'a ete detectee, indiquant que le gene chimere correspondant 
n'etait pas exprime dans les cellules de Tendosperrne ou que la proteine 
N13yZ etait degradee. Des ARNs des endospermes transformes avec les 
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ADNs codant pour les proteines H45yZ et N13yZ ainsi que les ARNs des 
endospermes non transformes ont ete analyses. A partir des ARNs totaux, 
les ADNc ont ete prepares et amplifies par PCR en utilisant des amorces 
specifiques. La Fig. 6B montre I'analyse en Southern blot de trois 

5 echantillons d'ADNc hybrides avec un oligonucleotide codant pour une 
sequence K(Pro-Lys) 4 utilisee comme sonde. Les resultats indiquent que le 
gene N13yZ etait correctement exprime (Fig. 6B, ligne 3). La presence de 
bandes dans les echantillons H45yZ et N13yZ, mais pas dans les 
endospermes non transformes, a suggere que la proteine N13yZ etait 

10 degradee pendant les 24 heures de I'incubation. A partir de ces 
observations, les inventeurs ont conclu que le site d'insertion des 
sequences riches en lysine etait critique pour ia stabilite de la y-zeine 
modifiee. 

15 La y-zeine enrichie en lysine est accumulee dans le corps proteiques 

En dehors de la teneur en lysine des sequences Pro-X, les 
proteines P20yZ, H30yZ et H45yZ avaient des caracteristiques communes 
avec la y-zeine de type sauvage : le peptide signal, le domaine repete en 

20 tandem N-terminal et la region riche en cysteine C-terminale etaient 
apparentees. II a semble important de determiner si ces domaines 
restaient totalement fonctionnels preservant le ciblage et la formation des 
corps proteiques ou si les sequences riches en lysine creaient un 
environnement special dans lequel ces proprietes pouvaient etre 

25 perturbees. Pour tester ceci, on a investigue si les y-zeines modifiees 
etaient capables de s'accumuler dans les corps proteiques. Un 
fractionnement sub-cellulaire a ete realise avec les endospermes 
transformes. Les homogenats d'endospermes bombardes ont ete charges 
sur des gradients de sucrose discontinus (20 %, 50 % et 70 % de sucrose) 
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et toutes les fractions collectees ont ete analysees par immunoblotting. 
P20yZ, H30yZ et H45yZ sedimentes sur la fraction des corps proteiques et 
aucune quantite significative de ces proteines n'a ete detectee, soit dans le 
surnageant, soit dans la fraction microsomale (Fig. 7A, lignes 2, 2 et 3). 
5 Bien que dans les experiences faites prealablement in-vitro (Fig. 4A) aient 
permis de constater que les y-zeines modifiees nouvellement synthetisees 
subissaient une translocation dans les microsomes canins, on a teste ici si 
les proteines modifiees exprimees in-vivo dans des cellules d'endosperme, 
subissaient une translocation dans la membrane du reticulum 

10 endoplasmique et restaient a I'interieur des corps proteiques derives du 
reticulum endoplasmique. Pour cette raison, des corps proteiques isoles 
ont ete soumis a une digestion par la proteinase K dans des tampons 
isotoniques (contenant 20 % de sucrose) ou apres un choc osmotique 
dans Teau (Fig. 7B). Les proteines protegees contre la degradation 

15 proteolytique par des enzymes peuvent etre entourees par une membrane 
et un traitement par des solutions detergeantes ou hypotoniques a resulte 
dans la digestion des proteines (Walter et Blobel, 1983, Method Enzymol. 
96 t 84-93). Une comparaison des intensites de bandes, apres digestion 
par la proteinase K dans des milieux comprenant du sucrose ou de I'eau, a 

20 permis de reveler que les proteines P20yZ. H30yZ et H45yZ etaient 
protegees de la digestion dans des tampons isotoniques (lignes 1, 3 et 5) 
mais etaient partiellement digerees dans Teau (lignes 2, 4 et 6). 

^expression des genes modifies de la y-zeine dans les cellules de 
la couche du sub-aleurone de I'endosperme par bombardement biolistique 

25 a permis d'observer que des y-zeines riches en lysines etaient largement 
accumulees a I'exception du cas ou les sequences riches en lysine etaient 
positionnees 5 residus en amont de Textremite C-terminale du polypeptide 
de la y-zeine. A partir de cette expression et d'etudes 
immunocytochimiques sur des corps proteiques isoles, les inventeurs ont 
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demontre que des y-zeines riches en lysine sont correctement accumulees 
dans ces organites et sont co-localisees avec les proteines endogen s y- 
zeine et a-zeine. 

Des corps proteiques isoles a partir des endospermes P20yZ ont ete 
5 examines par marquage de type immunogold et microscopie electronique. 
Sur des sections ultra fines incubees avec I'anticorps aP-L (Fig. 8A) t le 
marquage a Tor a ete detect e a Pinterieur des corps proteiques, indiquant 
que la proteine P20yZ riche en lysine etait accumulee a Pinterieur de ces 
organites. Dans des sections incubees seulement avec Panticorps aP-L, 
10 rimmunomarquage avait lieu uniquement sur quelques corps proteiques 
(contenant la y-zeine riche en lysine), la plus grande partie des corps 
proteiques isoles n'etait pas immunomarquee avec des anticorps aP-L 
parce qu'ils correspondaient a des cellules d'endosperme non 
transformees. Pour determiner si la y-zeine riche en lysine etait co- 
is localisee avec les polypeptides a-zeines et y-zeines, un double marquage 
en immuno-electromicroscopie a ete realise sur des corps proteiques 
isoles en utilisant les anticorps aZ et aP-L (Fig. 8B) et aG2 et aP-L 
(Fig. 8C et D). La figure 8B montre une micrographie de la section 
transversale de deux corps proteiques marques avec Panticorps aP-L 
20 (particule d'or de 15 nm) et avec I'anticorps ctZ (particule d'or de 5 nm). Le 
resultat de Timmunocoloration a montre que la proteine P20yZ est 
accumulee dans les corps proteiques et co-localisee avec Ta-zeine (voir 
Tetendue du marquage de Ta-zeine sur toute la surface du corps 
proteique). Par ailleurs, des sections tangentielles (Fig. 8B t voir les 
25 fleches) et transversales (Fig. 8D) des corps proteiques ont ete incubees 
avec Panticorps aP-L (particule d'or 15nm) et avec Panticorps aG2 
(particule d'or de 5 nm). Dans les deux cas f la proteine P20yZ etait co- 
localisee avec les polypeptides y-zeines. On a pu noter que les sections 
tangentielles du corps proteique (Fig. 8A, C, voir les fleches) etaient 
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facilement distinguees des sections transversales du corps proteique dans 
la mesure ou les premiers presentaient une densite plus forte en electrons 
et que le marquage de la y-zeine etait etendu sur toute la section. Au 
contraire les sections transversales presentaient une densite inferieure et 
5 le marquage de la y-zeine etait localise sur la membrane entourant le corps 
proteique. Dans les deux cas, la localisation du marquage de la y-zeine 
riche en lysine suivait celle de la y-zeine endogene. 

B) Preparation de plantes qenetiauement modifiees. exorimant des 
10 v- zeines riches en lysine 

1) Obtention et utilisation de cat de maYs comme cible pour ta 
transformation qenetigue. 

15 La transformation genetique du mais, quelle que soit la methode 

employee (electroporation; Aqrobacterium . microfibres, canon a particules) 
requiert generalement I'utilisation de cellules indifferenciees en divisions 
rapides ayant conserve une aptitude a la regeneration de plantes entieres. 
Ce type de cellules compose le cal friable embryogene (dit de type II) de 

20 mais. 

Ces cals sont obtenus a partir d'embryons immatures de genotype 
Hi II ou (A188 x B73) selon la methode et sur les milieux decrits par 
Armstrong (Maize Handbook ; (1994) M. Freeling, V. Walbot Eds ; pp 665- 
671). Les cals ainsi obtenus sont multiplies et maintenus par repiquages 
25 successifs tous les quinze jours sur le milieu d'initiation. 

Des plantules sont ensuite regenerees a partir de ces cals en 
modifiant I'equilibre hormonal et osmotique des cellules selon la methode 
decrite par Vain et al. (Plant Cell Tissue and Organ Culture (1989 18 : 143- 
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151). Ces plantes sont ensuite acclimatees en serre ou elles peuvent etre 
croisees ou autofecondees. 

2) Utilisation du canon a particules pour la transformation 
5 qenetique du mats. 

Le paragraphe precedent decrit I'obtention et la regeneration des 
lignees cellulaires necessaires a la transformation ; on decrit ici une 
methode de transformation genetique conduisant a Integration stable des 

10 genes modifies dans le genome de la plante. Cette methode repose sur 
('utilisation d'un canon a particules identique a celui decrit par J. Finer 
(Plant Cell Report (1992) 11 :323-328) ; les cellules cibles sont des 
fragments des cals decrits dans le paragraphe 1. Ces fragments d'une 
surface de 10 a 20 mm 2 ont ete disposes, 4h avant bombardement, a 

15 raison de 16 fragments par boite au centre d'une boite de petri contenant 
un milieu de culture identique au milieu ^initiation, additionne de 0,2 M de 
mannitol + 0,2M de sorbitol. Les plasmides portant les genes a introduire 
sont purifies sur colonne Qiagen® en suivant les instructions du fabricant. 
lis sont ensuite precipites sur des particules de tungsten (M10) en suivant 

20 le protocole decrit par Klein (Nature (1987) 327 :70-73). Les particules 
ainsi enrobees sont projetees vers les cellules cibles a I'aide du canon et 
selon le protocole decrits par J. Finer (Plant Cell Report (1992) 11 : 323- 
328). 

Les boites de cals ainsi bombardees sont ensuite scellees a I'aide 
25 de Scellofrais® puis cultivees a I'obscurite a 27°C. Le premier repiquage a 
lieu 24h apres, puis tous les quinze -jours pendant 3 mois sur milieu 
identique au milieu d'initiation additionne d'un agent selectif dont la nature 
et la concentration peuvent varier selon le gene utilise (voir paragraphe 3). . 
Les agents selectifs utilisables consistent generalement en composes 
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actifs de certains herbicides (Basta®, Round up®) ou certains 
antibiotiques (Hygromycine, Kanamycine...). 

On obtient apres 3 mois ou parfois plus tot, des cals dont ta 
croissance n'est pas inhibee par Tagent selectif, habituellement et 
5 majoritairement composes de cellules resultant de la division d'une cellule 
ayant integre dans son patrimoine genetique une ou plusieurs copies du 
gene de selection. La frequence d'obtention de tels cals est d'environ 0,8 
cal par boite bombardee. 

Ces cals sont identifies, individualises, amplifies puis cultives de 
10 fagon a regenerer des plantules (cf. paragraphe 1). Afin d'eviter toute 
interference avec des cellules non transformees toutes ces operations sont 
menees sur des milieux de culture contenant I'agent selectif. 

Les plantes ainsi regenerees sont acclimatees puis cultivees en 
serre ou elles peuvent etre croisees ou autofecondees. 

15 

3) Utilisation du gene Bar pour ta production plantes de mais 
qenetiauement modifiees avant incorpore et exprimant le gene H45vZ 

Le gene Bar de Streptomyces hygroscopicus code pour une 
20 phosphinotricine acyl transferase (PAT) qui inactive par acetylation la 
phosphinotricine-molecule active de I'herbicide Basta®. Les cellules 
portant ce gene sont done rendues resistantes a cet herbicide et peuvent 
etre selectionnees par son intermediaire. Pour la transformation des 
cereales la sequence codante du gene Bar est sous le controle d'une 
25 region regulatrice permettant une expression forte et constitutive dans les 
cellules vegetales. Une telle region est avantageusement constitute par le 
promoteur et le premier intron du gene Actine de riz tels que decrits par Mc 
Elroy (MoL Gen. Genet. (1991) 231 : 150-160). 

Ce gene chimerique est clone sur un plasmide permettant son 
30 amplification par Escherischi a coli. Ce plasmide (pDM 302 Cao (Plant Cell 
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Report (1992) 11:586-591) apres amplification puis purification sur colonne 
Qiagen® peut etre utilise en transformation genetique du mais en utilisant 
par exemple ia methode decrite dans Pexemple precedent. Dans ce cas les 
milieux de culture destines a la selection des cellules transformees sont 
5 additionnes de 2 mg/L de Phosphinotricine. 

Pour ('introduction du gene H45yZ une technique dite de 
cotransformation est avantageusement utilisee : les genes de selection 
(Bar) et d'interet (H45yZ) sont portes par des plasmides independants. 
Dans le cas de I'utilisation du canon a particules on procede a une 

10 coprecipitation des plasmides sur les particules de tungstene, la quantite 
totale d'ADN precipite sur les particules restant identique a ce qu'elle est 
dans le protocole standard (5pg d'ADN pour 2,5 mg de particules) chaque 
plasmide representant environ la moitie du poids total d'ADN utilise. 

L'experience montre qu'avec cette methode, la cointegration des 

15 plasmides dans les cellules vegetales est Tevenement le plus frequent (de 
Tordre de 90%) c'est-a-dire que pratiquement chaque plante ayant integre 
le gene Bar et selectionnee par son intermediate portera aussi le gene 
H45yZ. Le niveau de coexpression (pourcentage de plantes seiectionnees 
exprimant le gene H45yZ) est habituellement de Pordre de 70%. 

20 Les genes ainsi introduits sont generalement lies au sens 

genetique, le gene H45yZ peut ainsi avantageusement etre suivi dans les 
descendances grace a la resistance a Therbicide qui lui est etroitement 
associee. 

La quantite de proteine modifiee est determinee par les methodes 
25 decrites dans I'exemple A, notamment par immunoblotting sur extraits 
proteiques de grains de mais immatures ou matures, preleves en pool sur 
des plantes resistantes au BASTA. 
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4) Exemple explicitant I'etape d'introduction des transqenes t 
notamment du qfene codant pour la H45yZ. pour la modificati n du 
phenotype opaque 2 du mats 

5 Amelioration de mais opaque 2 par retrogression du gene de y- 

zeine enrichie en lysine. 

Les plantes transformees decrites dans Texemple precedent, 
presentent a la fois, une resistance au Basta et I'expression d'une y-zeine 
enrichie en lysine sont utilisees. Leur pollen est utilise pour feconder les 

10 plantes de mais opaque 2 de la lignee W64Ao2 qui ne contient qu'un seul 
gene y-zeine. Cette lignee a ete obtenue aupres du Maize Stock Center. 
Les plantes de ces descendances F1 sont selectionnees pour leur 
resistance au Basta et ensuite autofecondees. Les graines F2 ainsi 
produites sont analysees pour le phenotype opaque sur une table 

15 lumineuse et les grains opaque ou les grains vitreux sont semes et evalues 
pour la resistance au Basta. Dans le cas ou tous les grains opaques sont 
sensibles au Basta, la demonstration est faite que ('introduction du gene y- 
zeine enrichi en lysine dans la plante consideree, complemente le 
phenotype opaque 2. 

20 Dans un deuxieme temps, dans ces plantes Basta resistantes, des 

individus de genotype o2/o2 et ne possedant qu'un seul gene y-zeine sur 
le chromosome 7 sont selectionnes a Taide des sondes moleculaires 
codant pour le gene Opaque 2 et pour la y-zeine. Ces dernieres revelant 
des fragments de restriction polymorphes et seuls les individus presentant 

25 le patron type de la lignee W64o2 sont retenus (Lopes M.A. et al, 1995, 
Mol. Gen. Genet. 19 : 247, 603-613). 

Ces individus ont un contenu en lysine qui est en moyenne 
equivalent a celui du mats o2, voire superieur. A partir de ces individus, 
toute introgression dans des varietes ELITE, du caractere « forte teneur en 
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lysine » est observee via la determination de la resistance au BASTA t de 
la presence de I'allele o2 detectee par RFLP. 

5) Expression de v-zeines enrichies en lysine dans Arabidopsis 
thaliana. 

5 

Pour obtenir une transformation stable, ies constructions 
plasmidiques P20yZ et pH30yZ donees dans le plasmide Bluescript KS (-) 
ont ete inserees sous forme de fragments Hincll/Xbal dans le vecteur 
binaire pBin19 (Bevan, M. Nucl. Acids Res. 12: 8711-8721 (1984)), 

10 contenant le promoteur 35S du virus de la mosaTque du chou-fleur (CaMV) 
et Ies signaux de formation de I'extremite 3' et de polyadenylation du gene 
de I'octopine synthetase (ocs). Les nouveaux plasmides ont ete appeles 
p1 9P20yZ et p1 9H30yZ (figure 12). 

Les vecteurs binaires contenant les sequences codant pour les 

15 proteines P20yZ et H30yZ (p1 9P20yZ et p1 9H30yZ) ont ete transferes dans 
la souche LBA4404 d' Aorobacterium tumefaciens . Des plantes Arabidopsis 
ecotype RLD ont ete transformees selon la methode decrite par Valvekens, 
D., Van Montagu, M. et Van Lijsebettens, M. ((1988) Proc. 
Natl.Acad.Sci.USA 85: 5536-5540). Pour chaque construction, 10 plantes 

20 transgeniques ont ete criblees par analyse en immunoblot en utilisant un 
antiserum obtenu contre la v-zeine (aG2. Ludevid et al. 1985). Les plantes 
contenant les taux les plus eleves (correspondant a environ 0,1% de la 
quantite totale de proteines presentes dans les feuilles d' Arabidopsis) de 
produits transgeniques dans la generation F1 , ont ete choisies pour obtenir 

25 la generation F2. Ces plantes ont egalement ete selectionnees pour 
I'expression de la proteine recherchee. 

Des plantes transgeniques entieres, selectionnees dans un milieu 
contenant de la kanamycine ont ete homogeneisees dans de I'azote 
liquide. Les proteines transgeniques ont ete extraites selectivement avec 
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une solution contenant de I'ethanol/ de I'acide chlorhydrique HCI 0,125 N 
dans une proportion de 3:1 (v/v), avec 5% de mercaptoethanol , et 
d'inhibiteurs de protease. Les proteines extraites avec cette solution ont 
ete precipitees dans 5 volumes d'acetone et analysees par SDS-PAGE et 
5 immunoblotting. Les extraits proteiques des plantes non transgeniques ont 
ete utilises en tant que controles. Les proteines resultant de I'insertion des 
sequences K(P-K) 4 dans la y-zeine ont ete exprimees correctement dans 
les plantes Arabidopsis thaliana en utilisant le promoteur constitutif 35 S 
du CaMV. Sur les immunoblots, les anticorps aG2 et aPL reconnaissaient 

10 des bandes d'electrophorese correspondant aux proteines P20yZ et 
H30yZ. Ces bandes migraient avec des masses moleculaires apparentes 
conformes a celles que Ton avait prealablement observees dans les 
experiences de traduction / translocation in vitro (30 kD et 26 kD 
respectivement). Comme on I'a observe dans les plantes transgeniques 

15 Arabidopsis exprimant la y-zeine (Geli et al. Plant Cell 6: 1911-1922 
(1994)), les proteines P20yZ et H30yZ migraient sous la forme de deux 
bandes d'electrophorese a savoir les bandes correspondant a 36 et 30 kD 
pour P20yZ et les bandes correspondant a 32 et 26 kD pour H30yZ. Les 
bandes superieures pourraient correspondre a des produits ayant subi des 

20 modifications post-traductionnelles. Une telle modification post- 
traductionnelle n'a pas ete detectee dans les endospermes de mats 
transformes. Ce resultat suggere que la modification apparaitrait lorsque 
ces proteines sont exprimees dans un systeme heterologue tel que 
Arabidopsis thaliana . 

25 

6) Expression de v-ze ines enrichies en lysine recombtnantes dans 
du maTs 

Methode: 
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Apres obtention des plantes transgeniques, on les croise avec une 
lignee male non transformee. Par consequent, 50% des grains recoltes 
dans ce cas d'insertion unilocus seront transgeniques. Dans le but 
d'analyser les y-zeines enrichies en lysine dans les plantes transgeniques, 
5 les proteines sont extraites a partir de 6 grains par transformant. 

Les endospermes sont disseques en retirant des grains les 
embryons et les pericarpes. Les endospermes sont broyes et 50mg de 
farine sont utilises pour I'extraction selective. Prealablement les a-zeines 
sont extraites par 3 traitements avec de Tethanol a 70%. Apres 
10 centrifugation, I'ethanol est evapore sous vide. Les proteines insolubles 
dans I'ethanol (principalement les y-zeines et les y-zeines enrichies en 
lysine) sont extraites du culot avec un tampon de Laemli contenant 10% de 
mercaptoethanol (100fjl de tampon pour 10mg de farine). Les proteines 
totales sont ensuite analysees par coloration a Targent (Morrissey J.H. 
15 1981, Ann. Biochem. vol. 117, p. 307-310). Les y-zeines et les y-zeines 
enrichies en lysine sont analysees par immunoblot en utilisant 
respectivement les anticorps aG2 (dilution 1/2000) et ctPL (dilution 1/500). 
Un anticorps anti-anticorps de lapin conjugue a la phosphatase alcaline est 
utilise comme anticorps secondare dans I'immunoblot. Les extraits sont 
20 dilues pour permettre leur chargement sur SDS-page selon la methode 
analytique utilisee. 
Resultats : 

Accumulation d e v-zeines enrichies en lysine dans les plants 20vZ et 45yZ 
de mais transqeniaue : 
25 La figure 13 montre Timmunoblot des extraits de proteines revelees 

avec I'antiserum aPL (A,B) et les proteines totales apres coloration a 
I'argent (C). Comme on peut le voir dans la fig13A, 6 plants transgeniques 
20y2 ont ete testes avec I'antiserum aPL et de la y-zeine enrichie en lysine 
est exprimee dans les plants transgeniques A1 et D2 (pistes 1 et 6 
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respectivement). Dans le plant C1 (piste 4), seules des traces de y-zeine 
enrichie en lysine sont observees. Lorsque les extraits des 6 plants 
transgeniques 45yZ sont charges sur le gel et marques avec ('antiserum 
aPL (Fig 13B), on observe une forte reaction avec I'anticorps pour les 
5 transformants B1 et C1 (pistes 3 et 4 respectivement). 

II faut noter que la reaction avec I'anticorps dans les extraits des 
endospermes des plants 45yZ B1 et C1 est plus forte que dans les extraits 
des plants 20yZ A1 et D2. Ce resultat est confirme apres coloration a 
I'argent des gels (fig 13C) ou les 2 types de y-zeine : endogene et enrichie 

10 en lysine, sont colores. La y-zeine enrichie en lysine a un poids 
moleculaire apparent de 30kDa et la y-zeine endogene de 28 kDa. 
L'expression de y-zeine enrichie en lysine est plus faible dans le plant 45yZ 
B1 que dans C1 (pistes 2 et 3 respectivement). Dans les pistes 1 a 6 de la 
Fig 13C, une dilution identique des proteines des extraits d'endospermes 

15 des plants 45yZ et 20yZ avait ete deposee sur le gel. A cette dilution 
l'expression de la y-zeine enrichie en lysine est detectee seulement dans 
les endospermes des plants 45yZ B1 et C1. Cependant, lorsqu'un extrait 
plus important (40 ug de proteines par piste) de proteines 20yZ est charge 
sur le gel, une legere bande (voir fleche), correspondant a la y-zeine 

20 enrichie en lysine est detectee dans les endospermes 20yZ A1 et D2. Ce 
resultat indique que les plants 45yZ accumulent beaucoup plus de 
proteines selon I'invention que les plants 20yZ. Ceci est probablement du 
aux activites differentes des promoteurs. Les plantes 45yZ ont ete 
transformees avec un construit contenant la y-zeine selon I'invention sous 

25 le controle du promoteur y-zeine (1 ,7 kb) alors que les plantes 20yZ ont ete 
transformees avec la meme sequence codante mais sous le controle du 
promoteur 35S du CaMV. La coloration a I'argent est une technique 
generate de coloration des proteines. mais la forte couleur marron est 
surtout observee en presence de proteines basiques. Comme les a-zeines 
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ont ete extraites de la farine, comme decrit ci-dessus, elles sont absentes 

lors de I'analyse en SDS-page de la Fig 13C. 

Segregation 

Puisqu'il y a une segregation de 50% dans toutes les plantes 
5 transgeniques dans le cas d'un locus unique, une analyse grain par grain a 
ete realisee afin de quantifier le taux de y-zeine enrichie en lysine dans 
chaque transformant. L'analyse a ete faite uniquement avec les 
transformants 45yZ qui presentaient le plus fort taux d'expression. La 
figure 14 montre la coloration a Targent (A) et I'immunoblot avec aPL (B) 

10 de 5 endospermes differents de 45yZ B1 et C1. La faible bande 
electrophoretique presente sur toutes les pistes (voir A, pistes 1 a 10) 
correspond a la y-zeine endogene. Comme on peut le voir dans la Fig 14 
A, 2 des 5 endospermes accumulent de la y-zeine enrichie en lysine (voir 
pistes 3,4 et 6,7). Par consequent, environ ta moitie des graines 

15 accumulent des quantites significatives de proteines enrichies en lysine. Si 
on prend en compte le fait que des quantites identiques de proteines ont 
ete deposees sur le gel, on observe que le transformant 45yZ C1 a 
accumule plus de y-zeine enrichie en lysine que B1. En fait ce resultat est 
en accord avec ce qui est observe lors de la coloration a I'argent des 

20 extraits melanges d'endosperme (Fig 13 C, piste 3). La preuve de la 
presence de y-zeine enrichie en lysine dans ces endospermes est 
souiignee dans la Fig 14B. L'immunoblot utilisant I'antiserum aPL montre 
que 2 extraits d'endosperme de 45yZ B1 (pistes 3 et 4) et de 45yZ C1 
(pistes 4 et 6) accumulent de la y-zeine enrichie en lysine. Pour confirmer 

25 le pourcentage de grains transgeniques, 10 nouveaux grains de 
transformant 45yZ C1 ont ete analyses par immunoblot et utilisation de 
I'antiserum aPL (Fig 15 A). Comme attendu, environ la moitie des graines 
transgeniques sont detectees. Une bande immunoreactive a ete observee 
dans les extraits d'endosperme (pistes 1,2,9 et 10). 
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Estimation de la quantite de v-zeines enrichies en lysine dans les 
endospermes des transformants 45yZ : 

aG2 est un antiserum polyclonal qui reconnait les y-zeines 
5 endogenes et les y-zeines enrichies en lysine. La reactivite de cet 
antiserum avec les extraits d'endospermes 45yZ C1 a ete utilisee pour 
quantifier la quantite de y-zeines enrichies en lysine selon I'invention dans 
les endospermes des plantes transformees. 

La figure 15B presente I'immunoblot de 5 extraits proteiques 
10 correspondant a 5 endospermes 45yZ C1 (pistes 1 a 5). Comme attendu, 
seuls 2 endospermes ont presente un profil d'immunoreaction 
caracteristique des grains transgeniques. La bande superieure de 30 kDa 
correspond a la y-zeine enrichie en lysine (fleches sur piste 1 et 2) et la 
bande inferieure a la y-zeine endogene. On peut noter que dans les 
is endospermes des plantes non transgeniques la bande superieure est 
absente (pistes 3, 4 et 5). De fa?on surprenante, il semble que le contenu 
en y-zeine endogene est plus faible dans les extraits des plantes 
transgeniques que dans les plantes non-transgeniques (voir fleches sur 
pistes 1 et 5). 
20 A premiere vue, il a ete observe que : 

i) dans les endospermes transgeniques 45yZ C1, le ratio y-zeine enrichie 
en lysine/y-zeine endogene est de 7/3. Done la quantite de proteine 
modifiee selon Tinvention est au moins 2 fois plus importante que celle 
de la proteine endogene, 
25 ii) la quantite de y-zeine endogene dans les endospermes non 
transgeniques (voir pistes 3, 4 et 5) etait equivalente a celle de la y- 
zeine enrichie en lysine dans les endospermes transgeniques (voir 
pistes 1 et 2). 
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7) Expression de v-zeines enrichies en lysine recombinantes dans 
du ble 

Comme dans I'exemple 6) t it est possible de montrer la presence de 
y-zeines enrichies en lysine selon I'invention dans du ble. 
5 La transformation du ble peut notamment etre effectuee selon la methode 
decrite dans Weeks et ai., 1993, Plant Physiol., vol 102 : pages 1077-1084 
ou selon la methode decrite dans EP709462. 
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REVENPICATIONS 

1. Oligonucleotide comprenant au moins un enchainement 
5 codant pour un polypeptide repondant a la formule (P-K) n , dans laquelle : 

- n est un entier superieur ou egal a 2, 

- P represente un residu d'acide amine Proline, 

- K represente un residu decide amine Lysine, 

- le signe "-" symbolise une liaison entre les deux residus d'acides 
10 amines et notamment une liaison de type peptidique, les n unites (P-K) 

etant liees entre elles egalement par de telles liaisons par exemple de type 
peptidique. 

2. Oligonucleotide selon la revendication 1, comprenant un 
enchainement codant pour un polypeptide repondant a la formule (P-K) n , 

15 dans laquelle n est un entier superieur ou egal a 3, de preference n est 
egal a 4, 5, 6, 7, 8, 9, 10 ou 15. 

3. Oligonucleotide selon ia revendication 1 ou la 
revendication 2, comprenant un enchainement codant pour un polypeptide 
repondant a la formule (P-K) n , dans laquelle la sequence des n unites (P- 

20 K) est interrompue par un ou plusieurs residus d'acides amines differents 
des residus P ou K. 

4. Oligonucleotide selon Tune quelconque des revendications 1 
a 3, caracterise en ce que I'enchainement codant pour le polypeptide 
comprenant les n unites (P-K) est complete a son extremite 5 1 et/ou a son 

25 extremite 3\ par un ou plusieurs codons, codant par exemple pour au 
moins un residu lysine a I'extremite N-terminale du polypeptide forme. 

5. Oligonucleotide selon la revendication 4, caracterise en ce 
qu'il code pour un polypeptide repondant a la formule (P-K), a la formule 
K(P-K) 4 ou a la formule 2K (P-K) 4 . 
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6. Sequence de nucleotides recombinant comprenant un 
enchainement de nucleotides codant pour une proteine de reserve d'une 
plante, caracterisee en ce qu'elle comprend en outre un oligonucleotide 
selon Tune quelconque des revendications 1 a 5, insere en un site de 

5 I'enchainement de nucleotides choisi de fagon telle que : 

- ('expression de la sequence de nucleotides dans une cellule 
vegetale determinee permet d'obtenir une proteine de reserve modifiee 
localisee de iagon identique ou similaire a la proteine de reserve normale 
qui serait exprimee dans les memes conditions dans la meme cellule par 

10 I'enchainement de nucleotides codant correspondant, et/ou 

- la proteine de reserve modifiee codee par la sequence de 
nucleotides recombinant est reconnue immunologiquernent par des 
anticorps produits contre la proteine de reserve normale correspondante. 

7. Sequence de nucleotides selon la revendication 6, 
is caracterisee en ce que I'enchainement de nucleotides codant qu'elle 

comprend, code pour une proteine de reserve naturellement pauvre en 
lysine. 

8. Sequence de nucleotides selon la revendication 7, 
caracterisee en ce que I'enchainement de nucleotides codant qu'elle 

20 comprend, code pour une proteine de reserve naturellement produite par 
une plante utilisable en alimentation animale ou humaine. 

9. Sequence de nucleotides selon la revendication 7 ou la 
revendication 8, caracterisee en ce que I'enchainement de nucleotides 
codant quelle comprend, code pour une proteine de reserve naturellement 

25 produite par une plante de la famille des cereales. 

10. Sequence de nucleotides . selon la revendication 7 ou la 
revendication 8, caracterisee en ce que I'enchainement de nucleotides 
codant qu'elle comprend, code pour une proteine de reserve naturellement 
produite par une plante de la famille des legumineuses ou des cruciferes. 
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11. Sequence de nucleotides selon la r vendication 9, 
caracterisee en ce que I'enchainement de nucleotides codant qu'elle 
comprend, code pour une proteine de reserve de maTs. 

12. Sequence de nucleotides selon la revendication 11, 
5 caracterisee en ce que renchainement de nucleotides codant qu'elle 

comprend, code pour une proteine de reserve de la famille des zeines. 

13. Sequence de nucleotides selon la revendication 12, 
caracterisee en ce que renchainement de nucleotides codant qu'elle 
comprend, code pour une proteine de reserve est la y-zeine du maTs. 

io 14. Sequence de nucleotides selon la revendication 13, 

caracterisee en ce que renchainement de nucleotides codant pour la y- 
zeine de maTs qu'elie contient, repond a la sequence presentee sur la 
Figure 9. 

15. Sequence de nucleotides selon la revendication 7 ou la 
15 revendication 8, caracterisee en ce que I'enchainement de nucleotides 

codant qu'elle comprend, code pour une proteine de reserve d'une plante 
choisie parmi les suivantes : le soja, le tournesol, le tabac, le ble, I'avoine, 
la luzerne, le riz, le colza, Arabidoosis thaliana . 

16. Sequence de nucleotides selon la revendication 13 ou la 
20 revendication 14, caracterisee en ce que Toligonucleotide est insere a la 

place ou a la suite du domaine Pro-X ou dans le domaine Pro-X 
naturellement present dans la y-zeine du maTs. 

17. Sequence de nucleotides recombinante, caracterisee en ce 
qu'elle comprend une sequence de nucleotides selon Tune quelconque des 

25 revendications 1 a 16 sous le controle d'un promoteur ^expression. 

18. Sequence de nucleotides recombinante selon la 
revendication 17, caracterisee en ce que le promoteur est un promoteur 
specifique d'un tissu cellulaire determine, par exemple un promoteur 
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specifique pour I'expression dans les graines, et/ou dans les feuilles de 
plantes. 

19. Sequence de nucleotides seion la revendication 17, 
caracterisee en ce que le promoteur d'expression est celui de la y-zeine du 

5 mais. 

20. Sequence de nucleotides selon ta revendication 17, 
caracterisee en ce que le promoteur d'expression est le promoteur 
CaMV35S. 

21. Sequence de nucleotides selon Tune quelconque des 
10 revendications 16 a 20, caracterisee en ce qu'elle code pour Tun des 

polypeptides P20yZ ou PH45yZ repondant respectivement aux sequences 
representees aux figures 11 et 10. 

22. Vecteur de clonage et/ou d'expression, caracterise en ce qu'il 
comprend, en un site non essentiel pour sa replication, une sequence de 

15 nucleotides selon I'une quelconque des revendications 1 a 20. 

23. Vecteur de clonage et/ou d'expression, caracterise en ce qu'il 
s'agit de Tun des plasmides pP20yZ (CNCM N°l-1640), pH30yZ ou pH45yZ 
(CNCM N° 1-1639). 

24. Polypeptide code par une sequence selon Tune quelconque 
20 des revendications 1 a 17. 

25. y-zeine de mais modifiee riche en lysine, caracterisee en ce 
qu'elle est codee par la sequence de nucleotides selon la revendication 13 
ou 14. 

26. y-zeine de mais modifiee enrichie en lysine, caracterisee en 
25 ce que sa sequence d'acides amines est modifiee par au moins un 

polypeptide repondant a la formule (P-K) n ou a la formule K-(P-K) n , dans 
laquelle : 

- n est un entier superieur ou egal a 2, 

- P represente un residu d'acide amine Proline, 
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- K represente un residu d'acide amine Lysine, 

- le signe "-" symbolise une liaison entre les deux residus d'acides 
amines et notamment une liaison de type peptidique, les n unites (P-K) 
etant liees par des liaisons notamment de type peptidique, ledit 

5 polypeptide repondant a la formule (P-K) n ou K-(P-K) n etant substitue a une 
sequence naturellement presente dans la y-zeine de mais normale ou 
etant insere avec deletion d'un ou plusieurs acides amines de la sequence 
d'acides amines de la y-zeine de mais normale, ou etant ajoute dans la 
sequence d'acides amines de la y-zeine normale, le site d'insertion du 
10 polypeptide etant choisi de fa?on telle que : 

- lorsque ia y-zeine modifiee riche en lysine est produite dans une 
cellule hote, notamment une cellule vegetale, elle est localisee de fa$on 
identique ou similaire a la y-zeine de mais normale qui serait produite dans 
les memes conditions, dans la meme cellule hote, et/ou, 

15 - la y-zeine de mais modifiee est reconnue par des anticorps diriges 

contre la y-zeine normale de mais. 

27. y-zeine de mais modifiee selon la revendication 26, 

caracterisee en ce qu'il s'agit de la proteine P20y2 ou de la proteine H30yZ 

ou de la proteine H45yZ. 
20 28- Cellule hote recombinante, caracterisee en ce qu'elle 

comprend une sequence de nucleotides selon Tune quelconque des 

revendications 1 a 17. 

29. Cellule hote selon la revendication 28, caracterisee en ce 
qu'il s'agit d'une bacterie, par exemple de E. coli ou Agrobacterium 

25 tumefaciens . 

30. Cellule hote selon la revendication 28, caracterisee en ce 
qu'il s'agit d'une cellule vegetale. 

31. Cellule hote selon la revendication 30, caracterisee en ce 
qu'il s'agit d'une cellule de semences de plante. 
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32. Cellule hote selon ia revendication 31, caracterisee en ce 
qu'il s'agit d'une cellule d'endosperme de semences de mais. 

33. Cellule hote selon la revendication 32, caracterisee en ce 
qu'elie contient, integree de fa?on stable dans son genome, une sequence 

5 de nucleotides selon la revendication 13. 

34. Cellule hote selon la revendication 32, caracterisee en ce 
qu'elie produit une y-zeine de mais modifiee riche en lysine selon la 
revendication 26 ou la revendication 27. 

35. Cellule hote selon la revendication 30, caracterisee en ce 
10 qu'il s'agit d'une cellule de soja, de tournesol, de tabac, de ble, d'avoine, 

de luzerne, de riz, de colza, d' Arabidoosis . 

36. Semences produisant un polypeptide selon Tune quelconque 
des revendications 24 a 27. 

37. Plante produisant un polypeptide selon Tune quelconque des 
15 revendications 24 a 27. 

38. Plante selon la revendication 37, caracterisee en ce qu'il 
s'agit du ma'/s. 

39. Semences obtenues a partir des plantes selon la 
revendication 37 ou 38. 

20 40. Procede de production de plantes ou de semences exprimant 

une proteine de reserve modifiee, caracterise en ce qu'il comprend les 
etapes de : 

a) transformation d'une cellule vegetale, avec une sequence de 
nucleotides selon I'une quelconque des revendications 6 a 17, ou un 
25 vecteur selon I'une quelconque des revendications 22 ou 23. dans des 
conditions permettant I'expression de fa?on stable et fonctionnelle de la 
proteine de reserve modifiee codee par la sequence de nucleotides; 
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b) regeneration de plantes a partir de la cellule de plante 
transformee de I'etape a) ( pour obtenir des plantes exprimant la proteine 
de reserve modifiee, 

c) le cas echeant, obtention de semences a partir des plantes 
5 modifiees obtenues a I'etape b). 

41. Procede selon la revendication 40, caracterise en ce que la 
plante est le mais et la proteine de reserve est la y-zeine. 
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FEUILUE DE REPLACEMENT (RE6LE 26) 
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FIGURE 4 




FIGURE 6 
FEUILLE DE REMPLACEMENT (REGLE 26) 
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FIGURE 7 



FEUILLE DE REMPLACEMENT (REGLE 26) 
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FEUILLE DE REMPLACEMENT (REGLE 26) 
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y-Zeine 



10 20 30 40 50 60 

I I I I I | 

ATGAGGGTGTTGCTCGTTGCCCTCGCTCTCCTGGCTCTCGCTGCGAGCGCCACCTCCACG 
MRVLLVALALLALAASATST 
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CCGCCGCCGGTCCACCTGCCACCGCCGGTCCATGTGCCGCCGCCGGTTCATCTGCCGCCG 
PPPVHLPPPVHVPPPVHLPP 
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CCACCATGCCACTACCCTACTCAACCGCCCCGGCCTCAGCCTCATCCCCAGCCACACCCA 
PPCH YPTQPPRPQPHPQPHP 
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TGCCCGTGCCAACAGCCGCATCCAAGCCCGTGCCAGCTGCAGGGAACCTGCGGCGTTGGC 
CPCQQPHPSPCQLQGTCGVG 
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AGCACCCCGATCCTGGGCCAGTGCGTCGAGTTTCTGAGGCATCAGTGCAGCCCGACGGCG 
STPILGQCVEFLRHQCSPTA 
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ACGCCCTACTGCTCGCCTCAGTGCCAGTCGTTGCGGCAGCAGTGTTGCCAGCAGCTCAGG 
TPYCSPQCQSLRQQCCQQLR 
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CAGGTGGAGCCGCAGCACCGGTACCAGGCGATCTTCGGCTTGGTCCTCCAGTCCATCCTG 
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CAGCAGCAGCCGCAAAGCGGCCAGGTCGCGGGGCTGTTGGCGGCGCAGATAGCGCAGCAA 
QQQPQSGQVAGLLAAQIAQQ 
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ATGAGGGTGTTGCTCGTTGCCCTCGCTCTCCTGGCTCTCGCTGCGAGCGCCACCTCCACG 
MRVLLVALALLALAASATST 

70 80 90 100 110 120 

I I I I I I 

CATACAAGCGGCGGCTGCGGCTGCCAGCCACCGCCGCCGGTTCATCTACCGCCGCCGGTG 
HTSGGCGCQPPPPVHLPPPV 

130 140 150 160 170 180 

I I I I I I 

CATCTGCCACCTCCGGTTCACCTGCCACCTCCGGTGCATCTCCCACCGCCGGTCCACCTG 
HLPPPVHLPPPVHLPPPVHL 

190 200 210 220 230 240 

I I I I I I 

CCGCCGCCGGTCCACCTGCCACCGCCGGTCCATGTGCCGCCGCCGGTTCATCTGCCGCCG 
PPPVHLPPPVHVPPPVHLPP 

250 260 270 280 290 300 

I I I I I I 

CCACCATGCCACTACCCTACTCAACCGCCCCGGATCGAATTCAAACCAAAGCCAAAGCCG 
PPCHYPTQPP. RIEFKPKPKP 

310 320 330 340 350 360 

I I I I I I 

AAGCCAAAAGAATTCAAACCAAAGCCAAAGCCGAAGCCAAAAGAATTCCTGCAGCCCCTG 
KPKEFKPKPKPKPKEFLQPL 

370 380 390 400 410 420 

I I I I I I 

CAGGGAACCTGCGGCGTTGGCAGCACCCCGATCCTGGGCCAGTGCGTCGAGTTTCTGAGG 
QGTCGV GSTPILGQCVEFLR 

430 440 450 460 470 480 
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CATCAGTGCAGCCCGACGGCGACGCCCTACTGCTCGCCTCAGTGCCAGTCGTTGCGGCAG 
HQCSPTATPYCSPQCQSLRQ 
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I I I I I I 

CAGTGTTGCCAGCAGCTCAGGCAGGTGGAGCCGCAGCACCGGTACCAGGCGATCTTCGGC 
QCCQQLRQVEP. QHRYQAI F G 
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TTGGTCCTCCAGTCCATCCTGCAGCAGCAGCCGCAAAGCGGCCAGGTCGCGGGGCTGTTG 
LVLQSILQQQPQSGQVAGLL 
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GCGGCGCAGATAGCGCAGCAACTGACGGCGATGTGCGGCCTGCAGCAGCCGACTCCATGC 
AAQIAQQLTAMCGLQQPTPC 
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ATGAGGGTGTTGCTCGTTGCCCTCGCTCTCCTGGCTCTCGCTGCGAGCGCCACCTCCACG 
MRVLLVALALLALAASATST 
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CATACAAGCGGCGGCTGCGGCTGCCAGCCACCGCCGCCGGTTCATCTACCGCCGCCGGTG 
HTSGGCGCQPPPPVHLPPPV 
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CATCTGCCACCTCCGGTTCACCTGCCACCTCCGGTGCATCTCCCACCGCCGGTCCACCTG 
HL P P PVH L P P PVHL P P PVH L 

190 200 210 220 230 240 

I I I I I I 

CCGCCGCCGGTCCACCTGCCACCGCCGGTCCATGTGCCGCCGCCGGTTCATCTGCCGCCG 
PPPVHLPPPVHVPPPVHLPP 

250 260 270 280 290 300 

I I I I I I 

CCACCATGCCACTACCCTACTCAACCGCCCCGGCCTCAGCCTCATCCCCAGCCACACCCA 
PPCHYPTQPPRPQPHPQPHP 

310 320 330 340 350 360 

■ I I I | | 

TGCCCGTGCCAACAGCCGCATCCAAGCCCGTGCCAGATCGAATTCAAACCAAAGCCAAAG 
CPCQQPHPSPCQIEFKPKPK 

370 380 390 400 410 420 

• I I I I | 

CCGAAGCCAAAAGAATTCCTGCAGCCCCTGCAGGGAACCTGCGGCGTTGGCAGCACCCCG 
PKPKEFLQPLQGTCGVGSTP 

430 440 450 460 470 480 

11 I | | , 

ATCCTGGGCCAGTGCGTCGAGTTTCTGAGGCATCAGTGCAGCCCGACGGCGACGCCCTAC 
ILGQCVEFLRHQCSPTATPY 

490 500 510 520 530 540 

1 I I | | | 

TGCTCGCCTCAGTGCCAGTCGTTGCGGCAGCAGTGTTGCCAGCAGCTCAGGCAGGTGGAG 
CS PQCQSLRQQ C'CQQLRQVE 

550 560 570 580 590 600 

I I I I | I 

CCGCAGCACCGGTACCAGGCGATCTTCGGCTTGGTCCTCCAGTCCATCCTGCAGCAGCAG 
PQHRYQAI FGLVLQSI LQQQ 

610 620 630 640 650 660 

1 I I I I I 

CCGCAAAGCGGCCAGGTCGCGGGGCTGTTGGCGGCGCAGATAGCGCAGCAACTGACGGCG 

PQSGQVAGLLAAQIAQQLTA 
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